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DISEASE RELATED PROTEIN NETWORK 

The present invention relates to a method for generating a network of direct and 
indirect interaction partners of a disease-related (polypeptide comprising the steps of 
(a) contacting a selection of (polypeptides suspected to contain one or several of 
said direct or indirect interaction partners with said disease-related (poly)peptides 
and optionally with known direct or indirect interaction partners of said disease- 
related (polypeptide under conditions that allow the interaction between interaction 
partners to occur; (b) detecting (polypeptides that interact with said disease-related 
(polypeptide or with said known direct or indirect interaction partners of said 
disease-related (polypeptide; (c) contacting (polypeptides detected in step (b) with a 
selection of (polypeptides suspected to contain one or several (polypeptides 
.nteracting with said (polypeptides detected in step (b) under conditions that allow 
the .nteraction between interaction partners to occur; (d) detecting proteins that 
•nteract with said (polypeptides detected in step (b); (e) contacting said disease- 
related (polypeptide and optionally said known direct or indirect interaction partners 
of said disease-related (polypeptide, said (polypeptides detected in steps (b) and 
(d) and a selection of proteins suspected to contain one or several (polypeptides 
interacting with any of the afore mentioned (polypeptides under conditions that allow 
the interaction between interaction partners to occur; (f) detecting (polypeptides that 
.nteract with said disease-related (polypeptide and optionally said known direct or 
md.rect interaction partners of said disease-related (polypeptide or with said 
(polypeptides identified in step (b) or (d); and (g) generating a (polypeptide - 
(polypeptide interaction network of said disease-related (polypeptide and optionally 
sa.d known direct or indirect interaction partners of said disease-related (poly)peptide 
and said (polypeptides identified in steps (b), (d) and (f). Moreover, the present 
invention relates to a protein complex comprising at least two proteins and to 
methods for identifying compounds interfering with an interaction of said proteins 
Finally, the present invention relates to a pharmaceutical composition and to the use 
of compounds identified by the present invention for the preparation of a 
pharmaceutical composition for the treatment of Huntington's disease. 



Several documents are cited throughout the text of this specification. The disclosure 
content of the documents cited herein (including any manufacture's specifications, 
instructions, etc.) is herewith incorporated by reference. 

With the identification of >35.000 genes in the human genome the challenge arises to 
assign biological function to all proteins and to link these proteins to physiological 
pathways and disease processes. Since protein-protein interactions play a role in 
most events in a cell, clues to the function of an unknown protein can be obtained by 
investigating its interaction with other proteins whose function are already known. 
Thus, if the function of one protein is known, the function of the binding parners can 
be infered (deduced). This allows the researcher to assign a biological function to 
uncharacterized proteins by identifying protein-protein interactions. For example, 
several so far uncharacterized proteins in Caenorhabditis elegans were identified in a 
yeast two-hybrid screen for eukaryotic 26S proteasome interacting proteins and 
thereby could be linked to the ubiquitin-proteasome proteolytic pathway (Vidal et al., 
2001). Elucidation of protein-protein interactions is particularly desired when it comes 
to the generation of new drugs. For many diseases, the available drug portfolio is 
insufficient or inappropriate to provide a cure or to prevent onset of the disease. One 
such disease is Huntington's disease. 

Huntington's disease (HD) is a neurodegenerative disorder caused by an expanded 
polyglutamine (polyQ) tract in the multidomain protein huntingtin (htt). The elongated 
polyQ sequence is believed to confer a toxic gain of function to htt. It leads to htt 
aggregation primarily in neurons of the striatum and cortex and subsequently to the 
appearance of the disease phenotype. However, there is experimental evidence that 
loss of htt function may also contribute to HD pathogenesis. Since huntingtin 
aggregation correlates with disease progression, it is crucial to develop methods for 
identifying factors that promote or inhibit aggregation of huntingtin. 

Previously, a number of single interaction partners of huntingtin had been reported. 
In light of these reports, it is tempting to speculate that huntingtin is bound into a 
larger network of interacting partners, many of which might be capable of modulating 
huntingtin's activity and function by direct or indirect interaction. It is likely that an 



aberrant interaction of huntingtin with some of the members of said network will 
.mpa,r huntingtin's normal function. Moreover, this interaction might also be relevant 
for the conformation of huntingtin or for its solubility or state of aggregation 
interfering with the direct or indirect interactions of the protein-protein interaction 
network will provide an excellent basis for therapeutic intervention as it will allow to 
modulate huntingtin's activity or state of aggregation or both. The state of the art so 
far d.d not provide compounds capable of reducing or suppressing huntingtin 
aggregation since the factors promoting or suppressing huntingtin aggregation were 
not known. 

Thus, the technical problem underlying the present invention was to provide novel 
approaches for identifying direct or indirect interaction partners of disease-related 
proton*, which must be seen as new targets for drug development. The solution to 
th,s technical problem is achieved by providing the embodiments characterized in the 
claims. 

Accordingly, the present invention relates to a method for generating a network of 
direct and indirect interaction partners of a disease-related (poly)peptide comprising 
the steps of (a) contacting a selection of (poly)peptides suspected to contain one or 
several of said direct or indirect interaction partners with said disease-related 
(polypeptides and optionally with known direct or indirect interaction partners of said 
d,sease-related (poly)peptide under conditions that allow the interaction between 
interaction partners to occur; (b) detecting (polypeptides that interact with said 
d.sease-re.ated (polypeptide or with said known direct or indirect interaction partners 
of said disease-related (polypeptide*) contacting (polypeptides detected in step 
(b) with a selection of (polypeptides suspected to contain one or several 
(polypeptides interacting with said (polypeptides detected in step (b) under 
condiaons that allow the interaction between interaction partners to occur (d) 
detecting proteins that interact with said (polypeptides detected in step (b)' (•) 
contacting said disease-related (polypeptide and optionally said known direct or 
indirect interaction partners of said disease-related (polypeptide, said (polypeptides 
detected in steps (b) and (d) and a selection of preteins suspected to contain one or 
several (polypeptides interacting with any of the afore mentioned (polypeptides 
under conditions that allow the interaction between interaction partners to occur (0 
detecting (polypeptides that interact with said disease-related (polypeptide and 
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optionally said known direct or indirect interaction partners of said disease-related 
(poly)peptide or with said (poly)peptides identified in step (b) or (d); and (g) 
generating a (poly)peptide-(poly)peptide interaction network of said disease-related 
(poly)peptide and optionally said known direct or indirect interaction partners of said 
disease-related (poly)peptide and said (poly)peptides identified in steps (b), (d) and 
(f). 

In accordance with the present invention, the term "direct and indirect interaction 
partners" relates to (poly)peptides that either directly interact with the disease-related 
(poly)peptide (direct interaction) or that interact via a protein binding to/interacting 
with said disease-related (polypeptide. In the letter case, there is no direct contact 
between the direct interaction partner and the disease-related protein. Rather, a 
further protein forms a "bridge" between these two proteins. 

The term "known direct or indirect interaction partners" refers to the fact that for 
certain disease-related (poly)peptides, such interaction partners are known in the art. 
If such interaction partners are known in the art, it is advantageous to include them 
into the method of the invention. If no such interactions partners are known in the art, 
then the network may be generated starting solely from the known disease-related 
(poly)peptide. 

The term "conditions that allow the interaction between interaction partners to occur" 
relates to conditions that would, as a rule, resemble physiological conditions. 
Conditions that allow protein actions are well known in the art and, can be taken, for 
example from Golemis, E.A. Ed., Protein-Protein Interactions, Cold Spring Harbor 
Laboratory Press, 2002. 

The term "suspected to contain one or more of said direct or indirect interaction 
partners" relates to the fact that normally, a selection of (poly)peptides would be 
employed where the person skilled in the art would expect that interaction partners 
are present. Examples of such selections of (poly)peptides are libraries of human 
origin such as cDNA libraries or genomic libraries. 

The term "detecting proteins" refers to the fact that the (poly)peptides interacting with 
the "bait" (poly)peptides are identified within the selection of (poly)peptides. A further 
characterization or isolation of the "prey" (poly)peptides at this stage may be 



advantageous but is not necessary. The term "detecting (poly)peptides- preferabiy 
also comprises characterizing said (polypeptides or the nucleic acid molecules 
encoding said (poly)peptides. The skilled person knows that this can be done by a 
number of techniques, some of which are described for example in Sambrook et al 
Molecular Cloning, A Laboratory Manuel"; CSH Press, Cold Spring Harbor. 198 9 or 
Hrggrns and Hames (ads.). For example, the nucleotide sequence may be 
determined by DNA Sequencing, including PCR-Sequencing (see for example Mutlis 
K Faloona F, Scharf S, Saiki R, Horn G, Erlich H., Cold Spring Hart> Symp Quant 
8,0 1986:51 P, 1:263-73). Alternately, the amino ack. sequence of said 
(polypeptide may be determined. The skilled artesian knows various methods for 
sequencing proteins which include the method of Edman degradation, which is a 
prefened method of the present invention of determining the amino acid sequence of 
a protein However, the amino acid sequence of a protein or (polypeptide can also 
be relrably determined by methods such as for example Maldr-Tof, optionally in 
combination with the method of Edman dagradatJon. The interaction partner may be 
rdenMied either as fusion with a DNA binding domain or as fusion with an activation 
domarn. Preferably, if an interaction partner has been identified as a fusion molecule 
compnsrng a DNA binding domain, the interaction partner is cloned into a vector 
altowrng the expression of the interection partner as a fusion with an activation 
doman. Consequent protein interaction can be tested in the context the DNA 
actrvatron or the DNA binding domain. 

In accordance with the present invention, the fire, reund of detecting (polypeptides 

ZZTZ T ,he w (poly)pep,Wes in step (a) 

(polrtpephdes be considered as "prey (polypeptides is foilowed by the second 
round of detecting further interacting (polypeptides wherein the former "prey" 
(poWpeptides are now used as "bait" (polypeptides, in certain preferred 
embedments of .ha present invention such as in a two-hybrid detection system a re- 
donrng o, me former "prey (poiyjpeptides inte veCore ma, are suitable or 
expressing "bar (polypeptides may be desired. 

Accordingly, the invenflon describes a novel sbategy to identity p ro ,e ln .pre,ein 
rn teracbon networks for human ^ ^ ^ ^ P ^P*-n 

parr-wrse pro.ern-pre.ein interacts for Huntington's disease and is osefu, tor other 
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hereditary diseases as well. Several human hereditary diseases are summarized in 
table 5. 

A crucial step of the method of the invention is step (e). Here, the disease-related 
(poly)peptide and optionally said known direct or indirect interaction partners of said 
disease-related (poly)peptide are contacted under appropriate conditions, preferably 
at the same time, with both the (polypeptides identified in steps (b) and (d) and 
further with a selection of (poly)peptides suspected to contain further interaction 
partners. Alternatively, the various baits, preys and further selection partners are 
added one after another, so that the final pool contains all baits and preys so far 
identified, in addition to the further selection partners. In other terms, in this step of 
the method of the invention, all "baits" and all "preys" are pooled and, additionally, 
further potential interaction partners are added. In this way, surprisingly the number 
of directed or indirect interactions partners of the previously identified "baits" and 
"preys" could significantly be enhanced. It is to be understood that various preys 
identified in one detection step may interact with each other and not only with the 
baits that were employed for the identification. For example, if a collection of baits 
detects prays "a" and "b", the invention does not exclude that "a" also interacts with 
"b". The same holds true mutatis mutandis for the baits used in accordance with the 
present invention. 

It is further preferred in accordance with the present invention that the interaction of 
proteins is a specific interaction, such as a specific binding. This means that the 
(polypeptide being an interaction partner with a further (poly)peptide only or 
essentially only interacts with the interaction site(s) involved with this interaction 
partner. This does not exclude, of course, that further interaction sites of said 
(poly)peptide interact with further interaction partners, wherein in the corresponding 
interaction is preferably also specific. The concept also embraces that, if a 
(poly)peptide has several identical interaction sites, which in nature bind to different 
interaction partners, these different interaction partners are also bound by the 
(poly)peptide in the method of the present invention. 

In other terms, at least in the case of huntingtin, the number of interaction partners 
found in step (e) was enhanced in an exponential rather than in a linear fashion. 
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The term "(polypeptide" re fers alternatively to peptide or ,o (polypeptides. Peptides 
conventionally are covalentty linked amino acids of up to 30 residues, whereas 
polypeptides (also referred to as "plains") comprise 31 and more amino acid 
residues. 

The term "huntingtin" refers to a protein with the data bank accession number 
P42858 whtch is referenced for the purpose of the present invention as "wild-type 
hunting,* protein". However, the term "huntingtin" also comprises proteins encoded 
by the nucleic acid sequence deposited under accession number L12392 or to 
proteins encoded by nucleic acid molecules which hybridize to the nucleic acid 
molecule of L12392 under stringent conditions of hybridization. The present invention 
relates to all variants of .he huntingtin protein. In particular, retevant for the present 
inventor, are those variants of huntingtin which comprise a polyglutamine tract 
(polyQ tract) or an elongated poiyQ tract. A polyQ tract consists of two or more 
giutamines within the huntingtin protein. The insertion of additional glutamine codons 
wtil result in huntingtin proteins with, for example 2. 51, 75 or 100 added giutamines 
in comparison to the sequence deposited under accession number P42858 in fact 
the person skilled In the art knows the, the huntingtin protein may have a giutamine 
tract with any random number of giutamines in the range of 1 to 200 added 
giutamines. All these proteins are comprised by the present invention. 

The term "hybridizes under stringent conditions", as used in the description of the 
present invention, is well known to the skilled artisian and corresponds to conditions 
of h,gh stnngency. Appropriate stringent hybridization conditions for each sequence 
may be established by a person skilled in the art on well-known parameters such as 
temperature, composition of the nucleic acid moiecules, salt conditions etc, see for 
exampte, Sambrook e. a,., "Moiecuiar Cloning, A Laboratory Manual"; CSH Press, 
Cold Spnng Harbor, 1989 or Higgins and Hamas (eds.). "Nucleic acid hybridization a 

.""T" Pr6SS ' ^ 1985> 886 h PartiCUlar ,he 
Hybndtzation Strategy" by Britten & Davidson, 3 to 15. Stringent hybridization 

coitions are, for example, conditions comprising overnight incubation at 42" C in a 

solution comprising: 50% formamide. 5x SSC (750 mM NaCI. 75 mM trisodium 

ctrate). 50 mM sodium phosphate ( P H 7.6). 5x Denhardfs solution. 10% daxtran 

sulfate, and 20 micrograms/ml denatured, sheared salmon spam, DNA. followed by 
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washing the filters in 0.1 x SSC at about 65°. Other stringent hybridization conditions 
are for example 0.2 x SSC (0.03 M NaCI, 0.003M Natriumcitrat, pH 7) bei 65°C. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). Note that 
variations in the above conditions may be accomplished through the inclusion and/or 
substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardfs reagent, 
BLOTTO, heparin, denatured salmon sperm DNA, and commercially available 
proprietary formulations. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

The skilled person knows that the presence of additional codons in the nucleic acid 
sequence of huntingtin might significantly reduce the capability of this nucleic acid 
molecule to hybridize to the nucleic acid molecule deposited under L12392 and 
referenced as wild-type huntingtin protein. Nevertheless, such proteins shall still be 
comprised by the present invention. In fact, computer programs such as the 
computer program Bestfit (Wisconsin Sequence Analysis Package, Version 8 for 
Unix, Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, Wl 53711) or blast, capable of calculating homologies between two nucleic 
acid sequences, efficiently recognize nucleotide insertions and allow for an 
adjustment of gaps created by these insertions. The term "huntingtin" as used in the 
present invention, also includes those molecules of huntingtin, which have a 
homology of more than 95% to wild-type huntingtin when analyzed with a program 
like bestfit under conditions not weighing gaps created by polyQ tracts (gap 
penalty=0). 

The term "contacting" means bringing into contact so that two or more proteins or 
(poly)peptides can interact with each other, preferably under physiological conditions. 
The terms "interacting" or "binding" refer to a transient or permanent contact between 
two proteins or (polypeptides. Preferably, the (poly)peptide or protein is provided by 
expression from a nucleic acid molecule, more preferably from a cDNA molecule 
within a cDNA library. Alternatively, said nucleic acid molecule is a genomic nucleic 
acid molecule of a genomic DNA library, or a nucleic acid molecule from a synthetic 
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DNA or RNA library. Preferably, the nucleic acid molecule encoding the disease- 
related protein or its interaction partner is obtainable from nerve cells, brain tissue 
human adrenal gland, human bladder, human bone, human brain, human colon 
human dorsal root ganglion, human heart, human HeLa cells, human kidney, human 
liver, human lung, human mammary gland, human ovary, human pancreas, human 
placenta, human prostate, human retina, human salivary gland, human sceletal 
muscle, human small intestine, human smooth muscle, human spinal cord, human 
spleen, human stomach, human testis, human thymus, human thyroid, human tonsil 
human trachea, human uterus, human cell line HEP G2, human cell line MDA 435 
human fetal brain, human fetal heart, human fetal kidney, human fetal liver, human 
fetal spleen, human fetal thymus, human breast tumor, human cervix tumor, human 
colon tumor, human kidney tumor, human lung tumor, human ovary tumor, human 
stomach tumor, human brain tumor and/or human uterus tumor. 

The term "disease-related protein" refers to a protein known to be the causative 
agent of a disease or known to be involved in onset or progression of a disease 
Preferably, said disease is CHOREA HUNTINGTON or the disease-related protein is 
huntingtin. More preferably, the disease-related protein is selected from table 1 
and/or 2. The term "conditions that allow the interaction between interaction partners- 
means conditions that are similar to physiological conditions. Preferably, said 
conditions are physiological conditions. 

The term "selection of (polypeptides" refers to a library of (polypeptides which 
composes the above-mentioned libraries, but also includes libraries such as phage 
d,s P lay libraries. Preferably, the (poly)peptide is provided by expression from a 
nucle,c acid molecule. Preferably, the protein or (polypeptide expressed by said 
nucle.c acid molecule is a (poly)peptide comprising a DNA binding domain (DBD) (in 
th,s case the fusion protein is termed "bait") or (b) a (polypeptide comprising an 
act.vat.on domain capable of interacting with a transcription factor or an RNA 
polymerase and capable of activating transcription of a reporter or indicator gene (in 
th.s case the fusion protein is called "prey"). As used here, the terms "reporter gene" 
and "indicator gene" are to be understood as synonyms. It is important to note that 
one of the interaction partners will always comprise the amino acid sequence of a 
protein or (polypeptide translated from said nucleic acid molecule while the other 



interaction partner will comprise the amino acid sequence of a protein or protein 
fragment. Preferably, a bait used for a method of the present invention is selected 
from the proteins listed in table 1 and/or 2. If, for example, the proteins encoded by 
the nucleic acid molecules contain a DNA binding domain fused in frame, the fusion 
protein can bind to the DNA recognition sequence of the DNA binding domain. 
Interaction of said fusion protein with a second fusion protein containing an activation 
domain can induce transcription of a nearby indicator gene. The indicator gene may 
encode a selection marker such as a protein that confers resistance to an antibiotic 
including ampicillin, kanamycin, chloramphenicol, tetracyclin, hygromycin, neomycin 
or methotrexate. Further examples of antibiotics are Penicillins: Ampicillin HCI, 
Ampicillin Na, Amoxycillin Na, Carbenicillin disodium, Penicillin G, Cephalosporins, 
Cefotaxim Na, Cefalexin HCI, Vancomycin, Cycloserine. Other examples include 
Bacteriostatic Inhibitors such as: Chloramphenicol, Erythromycin, Lincomycin, 
Tetracyclin, Spectinomycin sulfate, Clindamycin HCI, Chlortetracycline HCI. 
Additional examples are proteins that allow selection with Bacteriosidal inhibitors 
such as those affecting protein synthesis irreversibly causing cell death. 
Aminoglycosides can be inactivated by enzymes such as NPT II which 
phosphorylates 3'-OH present on kanamycin, thus inactivating this antibiotic. Some 
aminoglycoside modifying enzymes acetylate the compound and block their entry in 
to the cell. Gentamycin, Hygromycin B, Kanamycin, Neomycin, Streptomycin, G418, 
Tobramycin Nucleic Acid Metabolism Inhibitors, Rifampicin, Mitomycin C, Nalidixic 
acid, Doxorubicin HCI, 5-Flurouracil, 6-Mercaptopurine, Antimetabolites, Miconazole. 
Trimethoprim, Methotrexate, Metronidazole, Sulfametoxazole. Alternatively, said 
indicator gene may encode a protein such as lacZ, GFP or luciferase, the expression 
of which can be monitored by detection of a specific color. Other proteins commonly 
used as indicator proteins are beta-galactosidase, beta-glucuronidase, green 
fluorescent protein (GFP), autofluorescent proteins, including blue fluorescent protein 
(BFP), glutathione-S-transferase (GST), luciferase, horseradish peroxidase (HRP), 
and chloramphenicol acetyltransferase (CAT). In general, however, the selection in 
the yeast two hybrid-system is based on a deficiency of the yeast strain to produce 
specific amino acids. The skilled person knows that any amino acid deficiency can be 
used for this selection strategy. 



Preferably said preys and baits are expressed from two separata expression vectors 
contained in one host cell. The nudaic acid moiecule encoding the preys and baits 
can be introduced into the host cell, for exampte, by transformation, transfection 
transduction or microinjection which are common techniques known to the person 
stalled m the art and which require no additional explanation. In addition, the nucleic 
aod molecule contains a chromosomal or episomal nucleic acid sequence encoding 
the above-mentioned indicator protein. The expression of said indicator protein is 
under control of a recognition sequence which serves as a binding site for the bait 
prolan. The nucleic add molecule may be fused either to a DNA binding domain or 
to an activation domain. Co-expression of only those bait- and prey fusion proteins 
wh,ch are capable of interacting will Induce the expresdon of one of the above- 
■dentffled indicator proteins end thus allow the identification a nucleic add molecule 
encoding a protein capable of interacting with huntingtin or an interaction or binding 
partner of huntingtin. The skilted person knows this system as the yeest two hybrid 
system. The yeast too hybrid system, which uses a baft protein-prey pr0 ,ein 
»mb,nation to induce transcription of toe reporter gene, is a preferred method to 
■dentrfy proteins capable of interacting with huntingtin or with a direct or indirect 
interaction or binding partner of huntingtin. See for example Fields and Song Nature 
340*45 (1989) or Uetz et a.., 2000 Nature 403(6770): 623-7. This is a useful way of 
determining protein-protein interactions. Another preferred method uses the yeast 
hree hybrid system, as described in U.S. Pat. No. 5,928,868. Preferably, steps (a) to 
(d) of the method for generating a network of direct and indirect interaction partners 
comprise the yeas, two hybrid system. Preferably, steps (e) and (f) of toe method for 
generate a network of direct and indirect interaction partners comprise yeas, 
■n teracton mating. Preferably, said "interaction mating- comprises the interaction of 
a rnteraction partners identified in steps (a) to (d). Also preferred is that the 
interaction partners identified in steps (a, to (d) interact as prey and bait proteins, so 
tha all prey proteins are contacted with all bait proteins. Using the array mating 
system each bait is tested individually for interact with every prey in the array 
Afiemafively. steps (e) and ( 0 of the method tor generating a networit of direct and 
,nd,rect .interaction partners comprise testing all interaction partnere identified in steps 
(a) to (d) ,n rnteraction assays such as biaoore or coimmunoprecipitation. When 
performs such an assay, i, is preferred that the interaction partners are tested as 
prey and/or bait fusion proteins or contain no fused (polypeptides. Preferably all 



interaction partners are contacted in the biacore or coimmunoprecipitation assay by 
themselves and by all other remaining interaction partners identified in steps (a) to 
(d). 

The method for generating a network of direct and indirect interaction partners of a 
disease related protein or (poly)peptide has proven to be an effective tool for 
unveiling the protein-protein interactions (PPI) of preferably monogenic diseases. 
This is exemplified by the analysis of the disease related protein of Chorea 
Huntington, the analysis of which has demonstrated that the method of the present 
invention will be useful in an approach to identify potential drugs in the treatment of 
CHOREA HUNTINGTON. Moreover, this method will also be effective in unveiling 
the protein-protein interactions of other disease related proteins and in identifying 
novel targets for treatment of these diseases. Using a preferred combination of 
library and matrix yeast two-hybrid screens, based on the methods of the present 
invention, a highly connected network was generated among 70 proteins involved in 
117 protein-protein interactions, 99 of which had not been described previously. As 
progression of Huntington's disease (HD) appears to be linked to huntingtin 
aggregation, a set of network proteins was tested for their potential to modulate this 
process. By using the methods of the present invention, it was discovered that the 
GTPase activating protein GIT1 strongly promotes huntingtin aggregation in vivo. 
GIT1 also localises to huntingtin aggregates in brains of transgenic mice and HD 
patients. Therefore, a combination of the methods of the present invention has 
proven to provide effective means for the identification of potential targets for 
therapeutic intervention. GIT1 is a selected example of a modulator interaction 
partner of huntingtin. The other proteins in the network of interaction partners 
disclosed by the present invention are further modulator interaction partners of 
huntingtin. 

Preferably, the interaction mating comprises using an array maiting system. In 
general, for this screen, MATa yeast cultures are transformed with plasmids 
encoding prey proteins and arrayed on a microtiter plate for interaction mating with 
individual MATa strains expressing bait proteins. Using this test system, each bait 
can be tested individually for interaction with every prey in the array. Diploid yeast 
clones, formed by maiting on YPD plates and expressing both, bait and prey 
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proteins, are selected on agar SDH plates, and further transferred for example by a 
spotting robot on SDIV plates to seteot for protein-protein interactions. In a more 
preferred embodiment of the method, plasmids encoding bait and prey pmteins are 
transformed into strains L40ccua and L40cco, respectively. L40cca clones are 
arrayed on microtitre plates and mixed with a single L40ccua clone for interaction 
ma„ng. These cells are transferred, preferably by a robot onto YPD medium plates 
and, after incubation for 20h to 28h at approximately 30-C. for selection of the cells 
were transferred onto SDI. medium plates, where mating taxes place, tor additional 
60h to 80h at approximately 30'C. For two-hybrid selection diploid cells are 
transferred onto SDIV medium piates with and without nylon or nitrocellulose 
membranes and incubated for approximately 5 days at about 30'C. The nylon or 
nttrocellulose membranes are subjected to the B-GAL assay. Positive clones can be 
vermed by ^transformation assays using plasmids encoding respective bait and 
prey plains. Other preferred methods for studying proteln-protein interactions 
according to the present invention are colocalizatton, coimmunoprecipitation 
screens of protein or (polypeptide arrays, library screens, in vivo and in vitro 
binding experiments using different tags such as HIS6, TAP or FLAG. 

In a preferred embodiment of the present invention's method for generating a 
network of direct and indirect interaction partners of a disease related pretein or 
polypeptide, plasmids encoding bait proteins are transformed into a strain such as 
UOccua. tested for the absence of reporter gene acMy and cc-transfotmed with a 
human feja, brain cDNA libra*. Independent transferees are plated onto minima, 
medium lackmg tryptophan, leucine, histidine and uracil (SDIV medium) and 
■ncubated a. about 30-C for 5 to 10 days. Cones are trensferred lnto mteroti tre 
Pfc.es, optionally using a picking robot, and grown over nigh, in liquid minima. 
med,um lackmg tryptophan and leucine (SDH medium). Subsequently, the clones are 
spotted onto nylon or nitrocellulose membranes ptaced on SDIV medium plates After 
mcubation for about 4 days membranes are subjected to a B^alactosidase (B-GAL) 
assay. Plasmids are prepared from positive clones and characterised, for exampte by 
r^tncfon analyses and sequencing. For retransfcxmation assays plasmids encoding 
baft and prey proteins are cotransfonned in the yeas, strain L40ccua and plated onto 
SDIV medium. 



The term "generating a protein-protein interaction (PPI) network" means listing the 
interactions of all proteins interacting or binding directly or indirectly interacting the 
disease related (poly)peptide or protein. Preferably, this can be done by displaying 
the information in a matrix or a network representation. In a more preferred 
embodiment of the present invention's method, the protein-protein interaction 
network is generated by using Pivot 1.0 (Prof. Ron Shamir, Prof. Yossi Shilo, Nir 
Orlev; Tel Aviv University (TAU); Dep. of computer science; Ramat Aviv; Tel Aviv 
69978; Israel). 

In a preferred embodiment of the invention, interactions are detected by using the 
yeast two-hybrid system, MALDI-TOF MS or electro spray MS. Preferably, yeast 
strains such as strains L40ccua and L40cca, are transformed with an expression 
selected from the group consisting of pBTM116, pBTM117, pBTM117c, pACT2, 
pAS2-1, pGADIO, pGAD424, pGAD425, pGAD426, pGAD427, pGAD428. 

In another preferred embodiment of the present invention's method for generating a 
network of direct and indirect interaction partners of a disease-related polypeptide, 
the method contains after step (d) the additional steps of isolating a nucleic acid 
molecule with homology to said nucleic acid molecule expressing the encoded 
protein and testing it for its activity as a modulator of huntingtin, wherein said nucleic 
acid molecule is DNA, RNA, cDNA, or genomic DNA. Said testing can be done in 
several different assays. Preferably, the testing is performed in a co- 
immunoprecipitation assay or an affinity chromatography-based technique. 
Generally, co-immunoprecipitation is performed by purifying an interacting protein 
complex with a single antibody specific for one protein in the protein complex and by 
detecting the proteins in the protein complex. The step of detection can involve the 
use of additional antibodies directed against proteins suspected of being trapped in 
the purified protein complex. Alternatively, at least one protein in the protein complex 
is fused to a tag sequence with affinity to a compound fixed to a solid matrix. By 
contacting the solid matrix with said tagged protein, further proteins binding to said 
protein can be purified and binding can be detected. GST or HA are preferred tags in 
accordance with the present invention. 

In a preferred embodiment of the present invention's method, said contacting step (e) 
is effected in an interaction mating two hybrid approach. 



In another preferred embodiment of the present invention's method, said method 
comprises after step (d) and before step (e) the steps of: <<f) contacting 
(polypeptides detected in step (d) with a selection of (poly)peptides suspected to 
contain one or several (poly)peptides interacting with said (poly)peptides detected in 
step (d) under conditions that allow the interaction between interaction partners to 
occur; and (d") detecting proteins that interact with said (poly)peptides detected in 
step (d'). 

This preferred embodiment of the invention, an additional step of identifying further 
interaction partners is carried out prior to the contacting of all "baits" and "preys" in 
one pool (step (e)). Optionally, further steps of selecting interaction partners in 
analogy to steps (d') and (d») may be infected prior to the pooling/interaction step. 

Diseases of particular interest for which interrelationships of disease-related proteins 
may be analyzed in accordance with the invention are provided in Table 5. 

In yet another preferred embodiment of the present invention's method, said disease 
related protein is a protein suspected of being a causative agent of a hereditary (sea 
Table 5), such as a monogenic disease. 

In another preferred embodiment of the present invention's method, said disease 
related protein is huntingtin and said interaction partners are the interaction partners 
as shown in table 1 and/or table 2. 

In another preferred embodiment of the present invention's method, said method 
comprises the step of determining the nucleotide sequence of a nucleic acid 
molecule encoding a direct or indirect interaction partner of the disease related 
protein. 

In another preferred embodiment of the present invention's method, said selections 
of proteins are translated from a nucleic acid library. 

In another preferred embodiment of the present invention's method, said selection of 
proteins in step (a) and/or (c) and/or (d') and/or (e) is the same selection or a 
selection from the same source. In another preferred embodiment of the present 



invention's method, said selection of proteins in step (a) and/or (c) and/or (d') and/or 
(e) is a different selection or a selection from a different source. 

Preferably, said source is selected from nerve cells, brain tissue, human adrenal 
gland, human bladder, human bone, human brain, human colon, human dorsal root 
ganglion, human heart, human HeLa cells, human kidney, human liver, human lung, 
human mammary gland, human ovary, human pancreas, human placenta, human 
prostate, human retina, human salivary gland, human sceletal muscle, human small 
intestine, human smooth muscle, human spinal cord, human spleen, human 
stomach, human testis, human thymus, human thyroid, human tonsil, human trachea, 
human uterus, human cell line HEP G2, human cell line MDA 435, human fetal brain, 
human fetal heart, human fetal kidney, human fetal liver, human fetal spleen, human 
fetal thymus, human breast tumor, human cervix tumor, human colon tumor, human 
kidney tumor, human lung tumor, human ovary tumor, human stomach tumor, human 
brain tumor and/or human uterus tumor. 

In another preferred embodiment of the present invention's method, said method is 
performed by contacting the proteins on an array. Preferably, said array is an array 
allowing to detect protein-protein interaction by the principle of a biacore detector. 

In another preferred embodiment of the present invention's method, said interactions 
are detected by using the yeast two-hybrid system. Preferably, said inteactions 
detected by using MALDI-TOF, MS, electro spray MS or biacore. 

In another preferred embodiment of the present invention's method, said method 
contains after step of (b), (d), (d") or (f) the additional steps of isolating a nucleic acid 
molecule with homology to said cDNA expressing the encoded protein and testing it 
for its activity as a modulator of huntingtin, wherein said nucleic acid molecule is 
DNA, or RNA, and preferably cDNA, or genomic or synthetic DNA, or mRNA. 

The present invention also relates to a nucleic acid molecule encoding a modulator of 
huntingtin, wherein said modulator is a protein selected from table 3. Figure 6 
provides the amino acid sequences of the new proteins or (poly)peptides listed in 
table 3. The term "modulator protein of huntingtin" comprises two types of proteins 
within the network of proteins interacting with huntingtin. Direct interaction or binding 
partners of huntingtin are those proteins in the PPI network of huntingtin that directly 



interact with or bind to huntingtin (see figure 2). Examples of these proteins are IKAP 
HYPA, CA150, HIP1, HIP11, HIP13, HIP15, CGM25, PFN2, HP28, DRP-l' 
SH3GL3, HZFH, HIPS, PIASy, HIP16. GIT1, Ku70 and FEZ1. Table 2 and figure 6 
provides a reference allowing to identify these proteins. The second class of proteins 
are indirect interaction or binding partners of huntingtin. i.e. those proteins in the PPI 
network of huntingtin that do not directly interact with or bind to huntingtin. Such 
proteins require a mediator, i.e. a direct binding partner of huntingtin to exert their 
huntingtin modulating function. Examples of these proteins are BARD1 or VIM, which 
bind to direct interaction partners of huntingtin. However, complexes of huntingtin 
and a direct interaction or binding partner are likely to interact with additional indirect 
interaction or binding partners. To summarize the above, modulator proteins of 
huntingtin can exert their function by direct or indirect contact to huntingtin. 

The term "modulator protein", as used in the present invention, refers to a protein 
capable of modulating the function or physical state of a second protein and 
comprises proteins that enhance or reduce (inhibit) the function or activity of 
huntingtin. Preferably, the modulator protein is a protein having an activity selected 
from the group consisting of oxidoreductase activity (acting on the CH-OH group of 
donors, acting on the aldehyde or oxo group of donors, acting on the CH-CH group of 
donors, acting on the CH-NH(2) group of donors, acting on the CH-NH group of 
donors, acting on NADH or NADPH. acting on other nitrogenous compounds as 
donors, acting on a sulfur group of donors, acting on a heme group of donors, acting 
on diphenols and related substances as donors, acting on a peroxide as acceptor, 
acting on hydrogen as donor, acting on single donors with incorporation of molecular 
oxygen, acting on the CH-OH group of donors, acting on superoxide as acceptor 
oxidizing metal ions, acting on -CH(2) groups, acting on iron-sulfur proteins as 
donors, acting on reduced flavodoxin as donor, acting on phosphorus or arsenic in 
donors, acting on x-H and y-H to form an x-y bond, other oxidoreductases) 
transferase activity (transferring one-carbon groups, transferring aldehyde or ketone 
residues, acyltransferases. glycosyltransferases, transferring alkyl or aryl groups 
other than methyl groups, transferring nitrogenous groups, transferring phosphorous- 
containing groups, transferring sulfur-containing groups, transferring selenium- 
containing groups), hydrolase activity (glycosylase activity, acting on ether bonds 
acting on peptide bonds, acting on carbon-nitrogen bonds (other than peptide 
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bonds), acting on acid anhydrides, acting on carbon-carbon bonds, acting on halide 
bonds, acting on phosphorus-nitrogen bonds, acting on sulfur-nitrogen bonds, acting 
on carbon-phosphorus bonds, acting on sulfur-nitrogen bonds, acting on carbon- 
phosphorus bonds, acting on sulfur-sulfur bonds, acting on carbon-sulfur bonds, 
lyases (carbon-carbon lyases, carbon-oxygen lyases, carbon-nitrogen lyases, 
carbon-sulfur lyases, carbon-halide lyases, phosphorus-oxygen lyases, other lyases), 
isomerases (racemases and epimerases, cis-trans-isomerases, intramolecular 
oxidoreductases, intramolecular transferases, intramolecular lyases, other 
isomerases), ligases activity (forming carbon-oxygen bonds, forming carbon-sulfur 
bonds, forming carbon-nitrogen bonds, forming carbon-carbon bonds, forming 
phosphoric ester bonds), transcription factor activity, filament protein, membrane 
protein and structural protein. 

In a preferred embodiment, the present invention's nucleic acid molecule is DNA, or 
RNA, and preferably cDNA, or genomic DNA or synthetic DNA or mRNA 

In another preferred embodiment of the invention, the nucleic acid molecule is double 
stranded or single stranded. 

In another preferred embodiment of the invention, the nucleic acid molecule is of 
vertebrate, nematode, insect, bakterium or yeast. Preferably, the nematode is 
Caenorhabditis elegans. In another more preferred embodiment of the present 
invention, the insect is drosophila, preferably drosiphila melanogaster. In another 
more preferred embodiment of the present invention, the vertebrate is human, mouse 
rat, Xenopus laevis, zebrafish. 

In yet another preferred embodiment of the present invention, the nucleic acid 
molecule is fused to a heterologous nucleic acid molecule. In a further preferred 
embodiment of the present invention, the heterologous (poly)peptide encoded by 
said heterlogous nucleic acid molecule is an immunoglobulin Fc domain. 

In another preferred embodiment of the present invention the nucleic acid molecule is 
labeled. Labeled nucleic acid molecules may be useful for purification or detection. 
Suitable labels include fluorochromes, e.g. fluorescein isothiocyanate (FITC), 
rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6- 



FAM), 2',7'-dimethoxy-4 1 ,5'-dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X- 
rhodamine(ROX), 6-carboxy-2\4\7\4,7-hexachlorofluorescein (HEX) 5. 
carboxyfluorescein (5-FAM) or N.N.N'.N'-tetramethyl-e-carboxyrhodamine (TAMRA) 
radioactive labels, e.g. «s, 3 H; etc . Tne |abe| mgy a|SQ be g ^ stage 

where the DNA is conjugated to biotin, haptens, etc. having a high affinity binding 
partner, e.g. avidin, specific antibodies, etc., where the binding partner is conjugated 
to a detectable label. In the case of amplification the label may be conjugated to one 
or both of the primers. The pool of nucleotides used in the amplification may also be 
labeled, so as to incorporate the label into the amplification product. Alternatively the 
double strand formed after hybridization can be detected by anti-double strand DNA 
specific antibodies or aptamers etc. 

In a more preferred embodiment said heterologous nucleic acid molecule encodes a 
heterologous polypeptide. Preferably said heterologous (poly)peptide, fused to the 
(polypeptide encoded by the nucleic acid molecule of the present invention, is a 
DNA binding protein selected from the group consisting of GAL4 (DBP) and LexA 
(D8P). Also preferred in accordance with the present invention are activation 
domains selected from the group consisting of GAL4(AD) and VP16(AD) Also 
preferred are (poly)peptides selected from the group consisting of GST, His Tag 
Flag Tag, Tap Tag, HA Tag and Protein A Tag. 

Thus, the sequence encoding the (poly)peptide may be fused to a marker sequence 
such as a sequence encoding a peptide which facilitates purification of the fused 
(polypeptide. In certain preferred embodiments of this aspect of the invention the 
marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
a PQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) among 
others, many of which are commercially available. As described in Gentz et al. Proc 
Natl. Acad. Sci. USA 86:821-824 (1989). for instance, hexa-histidine provides for 
convenient purification of the fusion protein. The »HA» tag is another peptide useful 
for purification which corresponds to an epitope derived from the influenza 
hemagglutinin protein, which has been described by Wilson et al Cell 37- 767 
(1984). 

The (poly)peptide may be expressed in a modified form, such as a fusion protein 
and may include not only secretion signals, but also additional heterologous 



functional regions. For instance, a region of additional amino acids, particularly 
charged amino acids, may be added to the N-terminus of the (poly)peptide to 
improve stability and persistence in the host cell, during purification, or during 
subsequent handling and storage. Also, peptide moieties may be added to the 
(poly)peptide to facilitate purification. Such regions may be removed prior to final 
preparation of the (poly)peptide. The addition of peptide moieties to (poly)peptides to 
engender secretion or excretion, to improve stability and to facilitate purification, 
among others, are familiar and routine techniques in the art. A preferred fusion 
protein comprises a heterologous region from immunoglobulin that is useful to 
stabilize and purify proteins. 

The present invention also relates to a method of producing a vector comprising the 
nucleic acid molecule the present invention. Furthermore, the present invention 
relates to a vector produced said method. 

The present invention also relates to a vector comprising the nucleic acid molecule of 
the present invention. Preferably said vector is a transfer or expression vector 
selected from the group consisting of pACT2; pAS2-1; pBTM116; pBTM117; 
pcDNA3.1; pcDNAI; pECFP; pECFP-C1; pECFP-N1; pECFP-N2; pECFP-N3; 
pEYFP-CI; pFLAG-CMV-5 a, b, c; pGADIO; pGAD424; pGAD425; pGAD427; 
pGAD428; pGBT9; pGEX-3X1; pGEX-5X1; pGEX-6P1; pGFP; pQE30; pQE30N; 
PQE30-NST; pQE31; pQE31N; pQE32; pQE32N; pQE60; pSE111; pSG5; pTET- 
CMV-AS; pTET-CMV-F°-AS; pTET-CMV-F°-S; pTET-CMV-MCS; pTET-CMV-S; pTK- 
Hyg; pTL1; pTL10; pTL-HAO; pTL-HA1; pTL-HA2; pTL-HA3; pBTM118c; pGEX-6P3; 
pACGHLT-C; pACGHLT-A; pACGHLT-B; pUP; pcDNA3.1-V5His; pMalc2x. Said 
expression vectors may particularly be plasmids, cosmids, viruses or bacteriophages 
used conventionally in genetic engineering plasmids, cosmids, viruses and 
bacteriophages used conventionally in genetic engineering that comprise the 
aforementioned nucleic acid. Preferably, said vector is a gene transfer or targeting 
vector. Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adeno-associated virus, herpes viruses, or bovine papilloma virus, may be used for 
delivery of the nucleic acid into targeted cell population. Methods which are well 
known to those skilled in the art can be used to construct recombinant viral vectors; 
see, for example, the techniques described in Sambrook et al., Molecular Cloning A 
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Interscience. N.Y. (1989). " 9 ^ 3SOC es and Wiley 

•n ye. a further preferred embodiment of the invent the vector contains an 

stabitizadon of the Lnscl and^ 9 ' emina,i0n " ' ranSCrtp,i0 ' , and 

regions. Possible regulatory elements oZiH heterologous promoter 

are the AOX1 or Gam P 9 expresslon in eukaryoHc host cells 

re me AUX1 or GAL1 promoter in yeast or the CMV- SV4n bow 

sarcoma virus), CMV-enhance, SV40-enhancer or m aZ l < Rous 
other animal cells Besid. J 7 9 " ' ntron ln mamm ^lian and 

Opbonally, me heterologous sequence can encode a fusion protein 



2 2 



including an C- or N-terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. In this 
context, suitable expression vectors are known in the art such as Okayama-Berg 
cDNA expression vector pcDVI (Pharmacia), pCDM8, pRc/CMV, pcDNAI, pcDNA3, 
the EchoTM cloning System (Invitrogen), pSPORTI (GIBCO BRL) or pRevTet- 
On/pRevTet-Off or pCI (Promega). 

The present invention also relates to a method of producing a host cell comprising 
genetically engineering cells with the nucleic acid molecule or the vector of the 
present invention. The present invention also relates to a host cell produced said 
method. Furthermore, the present invention relates to a host cell comprising the 
vector of the present invention. Preferably, said host cell contains an endogenous 
nucleic acid molecule which is operably associated with a heterologous regulatory 
control sequence, including the regulatory elements contained in the vector of the 
present invention. 

The present invention also relates to a method of producing a (polypeptide, 
comprising culturing the host cell of the present invention under conditions such that 
the (polypeptide encoded by said polynucleotide is expressed and recovering said 
(poly)peptide. 

The present invention also relates to a (polypeptide comprising an amino acid 
sequence encoded by a nucleic acid molecule the present invention, or which is 
chemically synthesized, or is obtainable from the host cell of the present invention, or 
which is obtainable by a method the present invention. 

In another preferred embodiment of the invention, the (poly)peptide or protein is of 
vertebrate, nematode, insect, bakterium or yeast. Preferably, the nematode is 
Caenorhabditis elegans. In another more preferred embodiment of the present 
invention, the insect is drosophila, preferably drosiphila melanogaster. In another 
more preferred embodiment of the present invention, the vertebrate is human, mouse 
rat, Xenopus laevis, zebrafish. 

In another preferred embodiment, the (poly)peptide of the present invention is fused 
to a heterologous (polypeptide. Such a fusion protein may include not only secretion 
signals, but also additional heterologous functional regions. For instance, a region of 



additional amino acids, particularly charged amino acids, may be added to the 
N-terminus of the (polypeptide to improve stability and persistence in the host cell 
during purification, or during subsequent handling and storage. Also, peptide 
moiet.es may be added to the (polypeptide to facilitate purification. Such regions 
may be removed prior to final preparation of the (polypeptide. The addition of 
peptide moieties to (polypeptides to engender secretion or excretion, to improve 
stabHrty and to facilitate purification, among others, are familiar and routine 
techn.ques in the art. A preferred fusion protein comprises a heterologous region 
from immunoglobulin that is useful to stabilize and purify proteins. 

in a preferred embodiment of the present invention, the (polypeptide of the present 
-nvention is fused to a heterologous (polypeptide which is an immunoglobulin Fc 
domain or Protein A domain. In another preferred embodiment of the present 
.nvention. the (polypeptide the (polypeptide is labelled. Preferably, the label is 
selected from the group consisting of fluorochromes, e.g. fluorescein isothiocyanate 
(FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein 
(6-FAM), 2',7'-dimethoxy-4',5'-dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X 
rhodamine(ROX), 6-carboxy-2',4',7'.4,7-hexachlorofluorescein (HEX) 5 
carboxyfluorescein (5-FAM) or N,N,N',N'-tetrameth y l-6-carboxyrhodamine (TAMRA) 
radioactive labels, e.g. " P , 3s s> a H . etc . The labeI may gIso be g ^ stage system 

where the protein or (polypeptide is conjugated to biotin, haptens, etc. having a high 
affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. In another preferred embodiment of the present 
invention the label is a toxin, radioisotope, or fluorescent label. 

In another preferred embodiment of the present invention, the (polypeptide contains 
or lacks an N-terminal methionine, it is well known in the art that the N-terminal 
methionine encoded by the translation initiation codon generally is removed with high 
efficency from any protein after translation in all eukaryotic cells. While the 
N-terminal methionine on most proteins also is efficiently removed in most 
prokaryotes, for some proteins this prokaryotic removal process is inefficient 
depend.ng on the nature of the amino acid to which the N-terminal methionine is 
covaiently linked. 



The present invention also relates to a protein complex comprising at least two 
proteins, wherein the two proteins are selected from the group of interaction partners 
listed in table 4. The term "protein complex" refers to a compound stably comprising 
at least two proteins wherein said stability allows to purify said protein complex. 

In a preferred embodiment of the present invention, the protein complex comprises 
GIT1 and huntingtin. 

The present invention also relates to an antibody specifically recognizing the 
(poly)peptide of the present invention or specifically reacting with the protein complex 
of the present invention. This antibody is characterized in not recognizing the 
individual components of the protein complex but rather the complex itself. As such, 
said antibody recognizes a combined epitope, composed of amino acids of two 
different proteins within the protein complex. Dissociation of the complex will be 
detrimental to antibody recognition. Therefore, antibody binding depends on the 
integrity of the protein complex. In a preferred embodiment of the present invention, 
the antibody is specific for a protein complex comprising GIT1 and huntingtin. 

In a preferred embodiment, the antibody of the present invention is polyclonal, 
monoclonal, chimeric, single chain, single chain Fv, human antibody, humanized 
antibody, or Fab fragment 

In a more preferred embodiment of the present invention the antibody is labeled. 
Preferably, the label is selected from the group consisting of fluorochromes, e.g. 
fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7Mjimethoxy^5-dichloro-6- 
carboxyfluorescein (JOE), 6-carboxy-X-rhodamine(ROX), 6-carboxy-2\4\7 , l 4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N.N.N'.N'-tetramethyl- 
6-carboxyrhodamine (TAMRA), radioactive labels, e.g. 32 P, 35 S, 3 H; etc. The label 
may also be a two stage system, where the antibody is conjugated to biotin, haptens, 
etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., where 
the binding partner is conjugated to a detectable label. In another preferred 
embodiment of the present invention the label is a toxin, radioisotope, or fluorescent 
label. 
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huntingtin aggregates isolated from the transfected cells of (b) In comparison with the 
amount of huntingtin aggregates isolated from the transfected cells of (a) is indicative 
of a protein's activity as an inhibitor of huntingtin aggregation. Preferably, the 
huntingtin protein or protein fragment of step (a) is HD169Q68 or HD510Q68 or 
HdexQ51. 

The term "promotes" means increasing the amount of huntingtin aggregation. 

Preferably said huntingtin protein or the fragments thereof is selected from the 
proteins listed in table 1 and/or 2. Preferably said insoluble aggregates are isolated 
by using a filter retardation method comprising lysing cells and boiling in 2%SDS for 
5min in the presence of 100mM DDT followed by a filtration step. The presence of 
aggregates is detected by using specific antibodies. 

In a preferred embodiment of the present invention, determining the amount of 
insoluble huntingtin is performed by using light scattering or size exclusion 
chromatography. In another preferred embodiment of the present invention prior to 
step (d) the cells are treated with an ionic detergent. In yet another preferred 
embodiment of the methods of the present invention, the huntingtin aggregates are 
filtered onto a membrane. 

The present invention also relates to a method for identifying compounds affecting, 
e.g. interfering or enhancing the interaction of huntingtin or of a direct or indirect 
interaction partner of huntingtin comprising (a) contacting interacting proteins 
selected from the group of interacting proteins listed in table 1 in the presence or 
absence of a potential modulator of interaction; and (b) identifying compounds 
capable of modulating said interaction. The contacting is performed under conditions 
that permit the interaction of the two proteins. Sometimes more than two interacting 
proteins might be present in a single reaction as additional interaction partners of 
those listed under table 1, can be tested. However, the compound may also be a 
small molecule. Preferably said compounds are antibodies directed to huntingtin or to 
said interaction partner listed in table 1, wherein these antibodies are capable of 
interfering with the interaction with huntingtin. Alternatively, said compound is a 
peptide fragment of 10 to 25 amino acid residues of an interaction partner listed in 
table 1, wherein said peptide fragment is capable of interfering with the interaction 



wrth huntingtin. In a more preferred embodiment of the present invention seid 
antibody is an antibody directed to GIT1. in another more preferred embodiment of 
the invention, said peptide fragment is a peptide fragment of GIT1 of 10 to 25 
capable of interfering with the interaction of GIT1 with huntingtin. Said interfering 
pept.de may contain additional modifications in order to Increase cellular uptake 
solubility or to increase stability. Such modifications are known to the pereon skilled 
.n the art and need not be listed here in detail. In a prefen-ed embodiment of the 
present invention, the methods for identifying a compound further comprise the steps 
of modeling said compound by peptidomendcs and chemically synthesizing the 
modeled compound. 

in another preferred embodiment of the present invention, the methods for identifying 
a compound further comprise producing said compound. In yet another preferred 
embodiment of the present invention, the method for identifying said compound 
further comprise modifiylng to achieve (i) modified site of action, spectmm of activity 
organ specificity, and/or (ii) Improved potency, and/or (II) decreased toxicity 
(improved therapeutic index), and/or <iv) decreased side effects, and/or (v) 

modfied onset of therapeutic action, duration of effect, and/or (vi) modified 
Pharmakinetic parameters (resorption, distribution, metabolism and excretion), and/or 
(v..) modified physico-chemical parameters (solubility, hygrescopicity, color taste 
odor stability, state), and/or (viii) improved general specHicity. organ/tissue 
speofiaty. and/or (ix) optimized application form and route by (I) esterification of 
carboxyl groups, or (ii) esterification of hydrexyl groups with carbon acids, or (Hi) 
esterification of hydroxy, groups to. e.g. phosphates, pyrophosphates or sulfates or 
hem, succinates, or (iv) formation of pharmaceutical* aoceptab.e salts or (v) 
formation of pharmaceutical* acceptable complexes, or (vi) synthesis of 
Phamtacologically active polymers, or (vii, introduction of hydrophilic moieties, or (vlli) 
mtreducon/exchange of subsutuents on aromates or side chains, change of 
subsMuen. pattern, or (Ix) modification by induction of isosteric or bioiaosteric 
mo,eties, or (x) synthesis of homologous compounds, or (xi) introduction of branched 
s,de chains, or (xii) conversion of alkyl substitoents to cyclic analogues, or (xiii) 
derivation of hydrexyl greup to keta.es, acetates, or (xiv) N-acetylafen to amides 
Phenylcarbama.es, or (xv, synthesis of Mannich bases, imines, or transformation of 



2 8 



ketones or aldehydes to Schifrs bases, oxfmes. aceteles. ketales, enolesters 
oxazolidines, thiozolidines or combinations thereof. 

The present invention also relates to a method of diagnosing Huntington's disease in 
a biological sample comprising the steps of (a) contacting the sample with an 
antibody specific for a protein of table 1 or 2 or an antibody specific for the protein 
complex of the present invention; and (b) detecting binding of the antibody to a 
protein complex, wherein the detection of binding is indicative of Huntington's 
dtsease or of a predisposition to develop Huntington's disease. Preferably, binding is 
detected by measuring the presence of a fluorescent label bound to the p rotein 
complex. 

In a preferred embodiment of the present invention's method p rot ein complex 
~nte,ne (a) G.T1 or (b) said antibody is specmc for a protein comptex containing 

The present invention also relates to a diagnostic agent/composition comprising the 
nucleic acid molecule of the present invention, the (polypeptide of the present 
invention including/or the (polypeptide mentioned in table 1 or 2, the anflbody of the 
Present invention, an antibody specifically reacting with a protein selected torn table 
2 and/or a protein selected from table 2. 

Moreover, me present Invention also reta.es to a pharmaceuflca. composition 
comprising the nucteic add molecule of the present invention, the (poly )P apflde of toe 
present invention, the interfering compound identified with a method of toe present 
invention, the antibody of the present invention, an antibody specifically reacting with 
a protein selected from table 2 and/or a protein selected from table 2. 

The pharmaceuflca. composition will be formulated and dosed in a fashion consistent 
w,to good medical practice, taking into account the clinical condition of the individual 
patient, toe site of delivery of the pharmaceutical composition, the method of 
administration, the scheduling of administration, and other (actors known to 
practitioners. The "etfectiva amount- of toe pharmaceuflca, composition for purposes 
herein is thus determined by such considerations. - 
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As a general proposition, the total pharmaceutical^ effective amount of 
pharmaceutical composition administered parenteral^ per dose will be in the range 
of about 1 ug protein /kg/day to 10 mg protein /kg/day of patient body weight, 
although, as noted above, this will be subject to therapeutic discretion. More' 
preferably, this dose is at least 0.01 mg protein /kg/day, and most preferably for 
humans between about 0.01 and 1 mg protein /kg/day for the peptide. If given 
continuously, the pharmaceutical composition is typically administered at a dose rate 
of about 1 ug/kg/hour to about 50 ug/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe 
changes and the interval following treatment for responses to occur appears to vary 
depending on the desired effect. 

Pharmaceutical compositions of the invention may be administered orally, rectally, 
parenteral^, intracistemally, intravaginally, intraperitoneally. topically (as by powders! 
ointments, drops or transdermal patch), bucally, or as an oral or nasal spray. By 
"pharmaceutically acceptable carrier" is meant a non-toxic solid, semisolid or liquid 
filler, diluent, encapsulating material or formulation auxiliary of any type. The term 
"parenteral" as used herein refers to modes of administration which include 
intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion. 

The pharmaceutical composition is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et al., Biopolymers 22:547-556 (1983)). poly (2- hydroxyethyl 
methacrylate) (R. Langer et al.. J. Biomed. Mater. Res. 75:167-277 (1981). and R. 
Langer, Chem. Tech. 72:98-105 (1982)), ethylene vinyl acetate (R. Langer et al.. Id.) 
or poly-D- (-)-3-hydroxybutyric acid (EP 133.988). Sustained-release pharmaceutical 
composition also include liposomally entrapped protein, antibody, (polypeptide, 
peptide or nucleic acid. Liposomes containing the pharmaceutical composition are 
prepared by methods known per se: DE 3.218.121; Epstein et al.. Proc. Natl. Acad. 



Scf. (USA) 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. (USA) 774030- 
4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485.045 and 4,544,545; and EP 102 324 
Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar 
type ,n which the lipid content is greater than about 30 mol. percent cholesterol the 
selected proportion being adjusted for the optimal therapy. 

For parenteral administration, in one embodiment, the pharmaceutical composition is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
■njectable form (solution, suspension, or emulsion), with a pharmaceutical* 
acceptable earner, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation 
For example, the formulation preferably does not include oxidizing agents and other 
compounds that are known to be deleterious to (poly)peptides. 

Generally, the formulations are prepared by contacting the components of the 
Pharmaceutical composition uniformly and intimately with liquid carriers or finely 
d.v,ded solid carriers or both. Then, if necessary, the product is shaped into the 
desired formulation. Preferably the carrier is a parenteral earner, more preferably a 
solut.cn that is isotonic with the blood of the recipient. Examples of such earner 
vehicles include water, saline, Ringer's solution, and dextrose solution. Non-aqueous 
vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 
l.posomes. The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
tox,c to recipients at the dosages and concentrations employed, and include buffers 
such as phosphate, citrate, succinate, acetic acid, and other organic acids or their 
salts; antioxidants such as ascorbic acid; low molecular weight (less than about ten 
rescues) (polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, g.utamic acid, aspartic acid or 
arg,n in e; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrine; chelating agents such as 
SU9ar a,Coho,s such as mannito ' or sorbitol; counterions such as sodium- 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG The 



prote.nacous components of the pharmaceutical composition are typically formulated 
in such vehicles at a concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 
mg/ml. at a pH of about 3 to 8. It will be understood that the use of certain of the 
foregoing excipients. carriers, or stabilizers will result in the formation protein or 
(poly)peptide salts. 

The components of the pharmaceutical composition to be used for therapeutic 
administration must be sterile. Sterility is readily accomplished by filtration through 
sterile filtration membranes (e.g.. 0.2 micron membranes). Therapeutic components 
of the pharmaceutical composition (poly)peptide compositions generally are placed 
■nto a container having a sterile access port, for example, an intravenous solution 
bag or vial having a stopper pierceable by a hypodermic injection needle. 

The components of the pharmaceutical composition ordinarily will be stored in unit or 
mult,-dose containers, for example, sealed ampoules or vials, as an aqueous solution 
or as a lyophilized formulation for reconstitution. As an example of a lyophilized 
formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous protein 
solut,on. and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized protein using bacteriostatic Water-for-lnjection. 

The invention also provides a pharmaceutical/diagnostic pack or kit comprising one 
or more containers filled with one or more of the ingredients of the 
Pharmaceutical/diagnostic compositions of the invention. Associated with such 
containers) can be a notice in the form prescribed by a governmental agency 
regu.at.ng the manufacture, use or sale of pharmaceuticals or biological products 
wh,ch notice reflects approval by the agency of manufacture, use or sale for human 
adm.n.stration. ,n addition, the (polypeptides of the components of the 
Pharmaceutical composition invention may be employed in conjunction with other 
therapeutic compounds. 

Finally, the present invention relates to the use of the nudeio acid molecule of the 
present invention, the interfering compound identified wHh a method of the present 
■nvention, the (polypeptide of the present invention Including/or the (po.y)pep,ide 
mentioned in tab.e 1 or 2. the antibody of the prasent invenhon. an anybody 
specncally reacting with a protein selected f rom table 2 and/or a protein selected 



from table 2 for the preparation of a pharmaceutical composition for the treatment of 
Huntington's disease. 



Tables: 
Table 1: 



PROTEIN-PROTEIN INTERACTIONS IN 
I THE PP| OF HUNTINGTIN 


| Da IIS {UoU) 


Preys (AD) 


I RAPri'i 

| DMr\Uj 


PLIP 


J tr iCa 


EF1G 


I wni 7 
| nuu 


CA150 


\ MD1.7 


HIP1 


1 uinn t 
1 rlUI.7 


HYPA 


[ HD1.7 


SH3GL3 


[HDexQ20 


CA150 


[MDexQ20 


HYPA 


| nDexQ20 


SH3GL3 


| nDexQol 


CA150 


| nDexQol 


HYPA ! 


J nDexQ51 


SH3GL3 


| mp53 


p53 


| mpo3 


PIASy 1 


I DIAOw 

| KIAoy 


SUMO-2 


I PI AQw ^ 


SUMO-3 


I \i\KA 
VIM 


NEFL 


| VIIVI 


VIMc 


I RARH1 


BAIP1 


I RARm 

I DAr\U I 


BAIP2 


BARD1 


BAIP3 


[BARD1 


FE21 


BARD1 


GIT1 


[BARD1 


HB01 1 


IBARD1 


HIP5 


BARD1 


HZFH 


BARD1 | 


KAP 


BARD1 r 


nHAP1 


BARD1 t 


>IAG4 


BARD1 F 


'lASy 


BARD1 F 


>TN 


BARD1 £ 


•ETBD1 
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BARD1 ; 


2HX1 


CLH-17 I 


<u70 


CLK1 


PIASy 


GADD45G 


BAIP3 


GADD45G 


CGI-125 


GADD45G 


CGI-74 


GADD45G 


EF1A 


GADD45G 


EF1G 


GADD45G 


G45IP1 


GADD45G 


G45IP2 


GADD45G 


G45IP3 


GADD45G 


HIP16 


GADD45G 


HIP5 


GADD45G 


LUC7B1 


GADD45G 


PIASy 


GADD45G 


PLIP 


GADD45G 


PTN 


GADD45G 


PTPK 


hADA3 


BAIP1 


hADA3 


Ku70 


hADA3 


MAGEH1 


hADA3 


PIASy 


HD1.7 


CGI-125 


HD1.7 


DRP-1 


HD1.7 


FEZ1 


HD1.7 


GIT1 


HD1.7 


HIP11 


HD1.7 


HIP13 


HD1.7 


HIP15 


HD1.7 


HIP16 


HD1.7 


HIP5 


HD1.7 


HZFH 


HD1.7 


IKAP 


HD1.7 


Ku70 


HD1.7 


PIASy 


HDdl.O 


FEZ1 


HDdl.O 


GIT1 


HDdl.O 


IKAP 


HDd1.3 


HZFH 
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HDd1.3 


IKAP 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HDexQ20 


CGI-125 


HDexQ20 


HIP13 


HDexQ20 


HP28 


HDexQ20 


PFN2 


HDexQ51 


CGI-125 


HDexQ51 


HIP13 


HDexQ51 


HIP15 


HDexQ51 


HP28 


HDexQ51 


PFN2 


HIP2 


PIASy 


HIPS 


APP1 


HIP5 


BAIP1 


HIP5 


BAIP2 


HIP5 


CGI-74 


HIPS 


FEZ1 


HIPS 


GIT1 


HIPS 


HB01 


HIPS j 


HMP 


HIPS 


KPNA2 


HIP5 


mHAP1 


HIPS 


NAG4 


HIP5 


PLIP 


IMPD2 


PIASy 


KPNB1 


PIASy 


KPNB1 


PTN 


mp53 


H2FH 


mp53 


ZHX1 


PIASy 


MAPIc3 


TAL1 


ZHX1 


TCP1G 


Ku70 


VIM 


ALEX2 


VIM 


BAIP1 


VIM 


DRP-1 


VIM 


G45IP1 


VIM 


B01 


VIM 


HSPC232 
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VIM 


HZFH 


VIM 


PIASy 


VIM 


SETBD1 


VIM 


SH3GL3 


ZNF33B 


mHAP1 


2NF33B 


2HX1 



Table 2 Classification of proteins in Huntington 's disease inte raction n^n/ 



NAME 



Huntingtin fragments 
HD1.7 hunlingtin 
HDdl.O huntingtin 
HDd1.3 huntingtin 
HdexQ2Q huntingtin 
HcfexQ51 huntingtin 

Transcriptional control and DNA maintenance 
BARD1 BRCA1 associated ring domain protein 1 
CA1 50 putative transcription factor CA1 50 

GADD45G growth arrest and DNA damage inducible protein GADD45 gamma 
hADA3 ADA3like protein 

histone acetyltransferase binding to ORC 
protein inhibitor of activated STAT protein gamma (PIASy) 
huntingtin interacting protein HYPA/FBP1 1 (fragment) 
zinc finger heficase HZFH 
IKK complex associated protein 
ATP dependent DNA heKcase II, 70 kDa subunit 
bromodomain containing protein NAG4 
cellular tumor antigen p53 
ceiiular tumor antigen p53 (C-termlnus) 
cellular tumor antigen p53 (mouse) 
cPLA2 interacting protein 

histone-lysine N-methyltransferase, H3 lysine-9 specific 4 
ubiquitin (ike protein SMT3A (SUMO-2) 
ubiquiUn like protein SMT3B (SUMO-3) 
zinc finger homeobox protein 2HX1 
zinc finger protein 33b 



FUSION ACCESSION 



DBD 
DBD 
DBD 
DBD 
DBD 



P42858 
P42858 
P42858 
P42858 
P42858 



100 
100 
100 
98 
75 



1-506 
1-320 
166-506 
1-90 
1-82 



HBOl 

PIASy 

HYPA 

HZFH 

IKAP 

Ku70 

NAG4 

p53 

p53c 

mp53 

PLIP 

SETDB1 

SUMO-2 

SUMO-3 

ZHX1 

ZNF33B 



APP1 
CLH-17 
HP28 
mHAP1 
HIP1 
HMP 
MAP1lc3 
NEFU 



amyloid like protein 1 precursor 
clathrin heavy chain 1 
axonemal dynein light chain (hp28) 
huntingtin associated protein 1 (mouse) 
huntingtin interacting protein 1 
mitofilin 

microtubule associated proteins 1A/1B light chain 3 
light molecular weight neurofilament protein 



loc 

N.C 
N f C 
N. C 
N. C 
N, C 



DBD 


Q99728 


99 


1-379 


N 


AD 


014776 


93 


299-629 


N 


DBD 


095257 


100 


18-159 


N 


DBD 


075528 


100 


23S432 


N 


AD 


095251 


100 


1-611 


N 


AD # DBD 


Q8N2W9 


100 


5-510 


N.C 


AD 


075400 


100 


8-422 


C, N 


AD, DBD 


Q9Y4I0 


100 


1830-2000 


N 


AD 


095163 


100 


1207-1332 


N.C 


AD 


P12956 


100 


298-608 


N 


AD 


Q9NPI1 


100 


94-651 


N 


AD 


P04637 


100 


1-393 


N 


AD 


P04637 


100 


248-393 


N 


DBD 


P02340 


100 


73-390 


N 


AD 


095624 


100 


5-461 


N.PN 


AD 


Q15047 


100 


1023-1291 


N 


AD 


P55854 


100 


1-103 


CN 


AD 


P55855 


100 


1-95 


C.N 


AD 


Q9UKY1 


100 


145-873 


N 


DBD 


Q8NDW3 


100 


527-778 


N 


AD 


P51693 


100 


243-555 


PM, EC 


DBD 


Q00610 


100 


1-289 


PM.V 


AD 


Q9BQZ6 


100 


3-258 


CN 


AD 


035668 


100 


3^71 


C, EE 


AD 


000291 


100 


245-631 


C. GN 


AD 


Q16891 


100 


212-758 


Mit 


AD 


Q9H491 


100 


58-170 


CN, MT 


AD 


Q8IU72 


100 


1-543 


CN. IF 
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PFN2 

PTN 

SH3GL3 

KPNA2 

KPNB1 

VIM 

VIMc 



profilin II 

pleiotrophin precursor (exon 1 1ncluded) 
SH3 containing GRB2 like protein 3 
karyopherin alpha-2 subunit 
karyopherin beta-1 subunit 
vlmentln 

vlmentin (C-terminus) 



Cell signaling and fate 

ALEX2 armadillo repeat protein ALEX2 

CLK1 protein kinase CLK1 

FE21 fasciculation and elongation protein 2eta 1 

GIT1 ARF GTPase activating protein GIT1 

proteln-tyros ine phosphatase kappa precursor 

Cellular metabolism ~~ " 

DRP-1 dlhydropyrimldlnase related protein 1 (C-terminus) 
IMPD2 inoslne-S'-monophosphate dehydrogenase 2 
TAL1 transaldolase 



AD 


P35080 


100 


1-140 


CN 


AD 


P21246 


100 


1-168 


PM. EC 


AD 


Q99963 


100 


3-347 


V 


AD 


P52292 


100 


141-529 


C.N 


DBD 


Q14974 


100 


668-876 


C.N 


DBD 


P08670 


100 


1-466 


CN, IF 


AD 


P08670 


100 


190-466 


CN. IF 


AD 


060267 


100 


127-632 


C f PM 


DBD 


P49759 


100 


209-484 


N 


AD 


Q99689 


100 


131-392 


C, PM 


AD 


Q9Y2X7 


98 


249-761 


PM, V 
PM, AJ 


AD 


Q15262 


100 


1227-1439 



Protein synthesis and turnover 



AD 
DBD 
DBD 



Q14194 
P12268 
P37837 



100 
100 
100 



EF1A 

EF1G 

EF1GC 

HIP2 

TCPG 



translation elongation factor 1 alpha 1 
elongation factor 1 gamma 
elongation factor 1 gamma (Oterminus) 
ubiqultln conjugating enzyme E2-25 kDa 
T-complex protein 1. gamma subunit 



Uncharacterized proteins 



AD 
AD. DBD 

AD • 
DBD 
DBD 



P04720 
P26641 
P26641 
P27924 
P49368 



100 
100 
100 
100 
100 



BAIP1 
BAIP2 
BAIP3 
CGI-74 
CGI-125 
G45IP1 
G45IP2 
G45IP3 
H1P5 
HIP11 
HIP13 
HIP15 
HIP16 
HSPC232 
LUC7B1 
MAGEH1 



BARD1 interacting protein 1[similarto RIKEN cDNA 1810018M11] 
BARD1 interacting protein 2 [hypothetical protein] 
BARD1 interacting protein 3 [hypothetical protein] 
CGI-74 protein 
CGI-125 protein 

GADD45G interacting protein 1 [hypothetical protein] 

GADD45G interacting protein 2 [B2 gene partial cDNA. clone B2E] 

GADD45G interacting protein 3 [OK/SW-CL.1 6] 

huntlngtin interacting protein 5 [hypothetical protein KIAA1377] 

huntingtin interacting protein 11 [hypothetical protein] 

huntingtin interacting protein 13 [metastasis suppressor protein] 

huntingtin interacting protein 15 [similar to KIAA0443 gene product] 

huntingtin interacting protein 16 [similar to KIAA0266 gene product] 

HSPC232 

putative SR protein LUC7B1 (SR+89) 
.melanoma associated antigen H1 



AD 
AD 
AD 
AD 
AD 
AD 
AD 
AD 
AD. DBD 
AD 
AD 
AD 
AD 
AD 
AD 
AD 



Q9BS30 
Q9H0I6 
Q96HT4 
Q9Y383 
Q9Y3C7 
Q9H0V7 
Q9NYA0 
Q8NI70 
Q9P2H0 
Q96E29 
Q96RX2 
Q96D09 
Q9BVJ6 
Q9P0P6 
Q9NQ29 
Q9H213 



100 

100 

100 

100 

100 

100 

100 

100 

100 

100 

100 

100 

100 

92 

99 

100 



345-572 


C 


34-514 


C 


3-337 


C 


294-462 


C, MT 


2-437 


C. MT 


123-437 


C, MT 


1-200 


C.N 


252-544 


C 


1-226 


UN 


107-684 


UN 


152^*36 


UN 


159-270 


UN 


1-131 


UN 


1-340 


UN 


566-926 


UN 


3-134 


UN 


445-988 


N. C 


176-328 


UN 


512-755 


UN 


663-838 


UN 


585-771 


UN 


1-319 


UN 


116-371 


ER 


1-219 


UN 



S^sSiT^^ •*-■?" «"»■* DBD, ONA b.„d.n 3 domain; AJ 

Intermediate filaments; GN* Golgl network- MIL StoSSSK Mr"™? ^ B 1?° som ^- ER « endoplasmic reticulum; If| 
perlnucleanUN.unknoWniV.veslclesaLdataba^ mlc^tubules; N, nucleus; PM. plasma membrane; PM, 
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Table 3 New proteins in Huntington's disease interaction network 


ID NAME 


FUSION 


ACCESSION 


I DEN 


aa MATCH 


LOC 


Transcriptional control and DNA maintenance 












BARD1 BRCA1 associated ring domain protein 1 


DBD 


Q99728 


99 


1-379 


N 


CA150 putative transcription factor CA150 


AD 


014776 


93 


299-629 


N 


Cell signaling and fate 












GIT1 ARF GTPase activating protein GIT1 


AD 


Q9Y2X7 


98 


249-761 


PM.V 


HSPC232 HSPC232 


AD 


Q9P0P6 


92 


1-319 


UN 


LUC7B1 putative SR protein LUC7B1 (SR+89) 


AD 


Q9NQ29 


99 


116-371 


ER 



Abbreviations: aa, amino acids; IDEN, Identity; LOC, localisation; AD, activation domain; DBD, DNA binding domain; AJ, 
adherens junctions; C, cytosol; CN, cytoskeleton; EC, extracellular space; EE, early endosomes; ER, endoplasmic reticulum; IF, 
Intermediate filaments; GN, Golgi network; Mit, mitochondria; MT, microtubules; N, nucleus; PM, plasma membrane; PN, 
perinuclear; UN, unknown; V, vesicles; [ ] f database annotation 



Table 4: 



New protein-protein interactions, found 


Baits (DBD) 


Preys (AD) 


BARD1 


BAIP1 


BARD1 


BAIP2 


BARD1 


BAIP3 


BARD1 


FEZ1 


BARD1 


GIT1 


BARD1 


HB01 


BARD1 


HIP5 


BARD1 


HZFH 


BARD1 


IKAP 


BARD1 


mHAP1 


BARD1 


NAG4 


BARD1 


PIASy 


BARD1 


PTN 


BARD1 


SETBD1 


BARD1 


ZHX1 


CLH-17 


Ku70 


CLK1 


PIASy 


GADD45G 


BAIP3 


GADD45G 


CGI-125 


GADD45G 


CGI-74 
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\Jr\LJ U*f JO 


EF1A 




EF1G 




G45IP1 


GAnnafifs 


G45IP2 




G45IP3 


0/AUU40VJ 


HIP16 




HIPS 


v3nUU40b 


LUC7B1 




PIASy 


oAUUhOG 


PLIP 


WMJD45G 


PTN 


bAUU40b 


PTPK 


H Ar\ A o 


BAIP1 


l-i A A 1 


Ku70 


K APi AO 


MAGEH1 


It A A o 

nADA3 


PIASy 


MU1.7 


CGI-125 ! 


rtU 1.7 


DRP-1 


nU 1.7 


FEZ1 


HD1.7 


GIT1 




HIP11 


nUi./ 


HIP13 


nUi .7 


HIP15 


nu i.f 


HIP16 


\AT\A 7 


HIP5 


nul.7 


H2FH 


nui./ 


IKAP 


nni 7 
nui.f 


Ku70 


uni 7 


PIASy 


nUai.O 


FE21 


nuui.u 


GIT1 


nuu i .u 


IKAP 


nuu 1 .0 


HZFH 


HDd1.3 


IKAP 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HDexQ20 


CGI-125 


HDexQ20 


HIP13 


HDexQ20 


P28 



40 



HDexOM 


PFN2 


MUCAVkJ 1 


CGM25 


n ucauj i 


Mini 0 

HIP13 




HIP15 


nuexwo i 


HP28 


nuexvjon 


PFN2 


MIP9 


PlASy 


1-1 IDC 


APP1 


LJi DC 


BA1P1 


UIDC 

nlPO 


BAIP2 


UIDR 


CGI-74 


UJ IDC 

nlPo 


FEZ1 


m I DC 

nlPo 


GIT1 


Mine 

nlPo 


HB01 


HIP5 


HMP 


HIP5 


KPNA2 


nlPo 


mHAP1 


nlPo 


NAG4 


nlPo 


PLIP 


ik JDP\0 

IMPD2 


PlASy 


i/nkin a. 

KPNB1 


PlASy 


KPNB1 


PTN 


mp53 


H2FH 


mpo3 


2HX1 


Dl A 0» . 

PlASy 


MAPIc3 


XAI 4 

I ALT 


ZHX1 


Tr»n<i /"» 
I OPIC3 


Ku70 


Wilt A 
VIM 


ALEX2 


WIMI 

VIM 


BAIP1 


VIM 


DRP-1 


\/IM 
VIM 


G45IP1 


\/IM 
VIM 


HB01 


VIM 


HSPC232 


VIM 


HZFH 


VIM 


PlASy 


VIM 


SETBD1 


VIM 


SH3GL3 


ZNF33B 


mHAP1 


ZNF33B 


ZHX1 



Table 5: 

* Aarskog syndrome 

* Achromatopsia 

* Acoustic neuroma 

* Adrenal hyperplasia 

* Adrenoleukodystrophy 

* Agenesis of corpus callosum 

* Aicardi syndrome 

* Alagille syndrome 

* Albinism 

* Alopecia areata 

* Alstrom syndrome 

* Alpha-1 -antitrypsin deficiency 

* Alzheimer 

* Ambiguous genitalia 

* Androgen insensitivity syndrome(s) 
*Anorchia 

* Angelman syndrome 

* Anopthalmia 

* Apert syndrome 

* Arthrogryposis 

* Ataxia 

* Autism 

* Bardet-Biedl syndrome 

* Basal cell carcinoma 

* Batten disease 

* Beckwith-Wiedemann syndrome 

* Blepharophimosis 

* Blind 

* Branchio-Oto-Renal (BOR) syndrome 
Canavan 



£J2£ ( f ? Xia * e,ar, 9 |ecta sia. basal cell nevus, brain /spine, breast colon / bowel 
leukemia / lymphoma, lung, melanoma / skin, multiple endoOT ? neTeo3as^ Si 

Pigme^ar 19 ' retinob,astoma « von H^eTu^u^^ ~+ 



* Cardiofaciocutaneous syndrome 

* Celiac sprue 

* Charcot-Marie-Tooth 

* CHARGE association 



2on^° S ? me anomalies ~ tn 'somy, deletions, inversions, duplications 
U ^^X^'^^' 5P-), 6, 8m >Twsomy 9, Op.), 

triUm;?i« 5 V' 3 (t "!2 my 13 ' Patau >■ 15 - 16 (mosaic), 18 (18q-. 18p- rinq 18 
risomy 18, tetrasomy 18p, Edwards), 21 (Down syndrome trisomy 211 22 X &V 
[sex chromosome anomalies, Klinefelter (W, oth'er), tJS^^^^ 



* Cleft lip and/or cleft palate 

* Cockayne syndrome 

* Coffin-Lowry syndrome 

* Coffin-Siris syndrome 

* Congenital heart defects 



* Connective tissue conditions 

* Cooley anemia 

* Conjoined twins 

* Cornelia de Lange syndrome 

* Costello syndrome 

* Craniofacial conditions 

* Cri-du-Chat (5p-) 

* Cystic fibrosis 

* Cystinosis 

* Cystinuria 

* Dandy-Walker syndrome 

* Deaf / hard of hearing 

* Dermatological (skin) conditions 

* Developmental delay / mental retardation 

* DiGeorge syndrome 

* Down syndrome 

* DRPLA 

* Dubowitz syndrome 

* Dwarfism/ short stature 

* Dysautonomia 

* Dystonia 

* Ectodermal dysplasia 

* Ehlers Danlos syndrome 

* Endocrine Conditions 

* Epidermolysis bullosa 

* Facial anomalies, disfigurement 

* Fanconi anemia 

* Fetal alcohol syndrome and effects 

* FG syndrome 

* Fragile-X syndrome 

* Friedreich ataxia 

* Freeman Sheldon syndrome 

* Galactosemia 

* Gardner syndrome 

* Gastroenterology conditions 

* Gaucher disease 

* Glycogen storage disease 

* Goldenhar syndrome 

* Gorlin syndrome 

* Hallerman Streiff syndrome 

* Hearing problems 

* Heart conditions 

* Hemochromatosis 

* Hemophilia 

* Hemoglobinopathies 

* Hereditary hemorrhagic telangiectasia 

* Hereditary spastic paraplegia 

* Hermansky-Pudlak syndrome 

* Hirschsprung anomaly 

* Holoprosencephaly 
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* Huntington disease 

* Hydrocephalus 

* Ichthyosis 

* Immune deficiencies 

* Incontinentia pigmenti 

* Infertility 

* Intestinal problems 

* Joseph disease 

* Joubert syndrome 

* Kabuki syndrome 

* Kidney conditions 

* Klinefelter syndrome 

* Klippel-Feil syndrome 

* Klippel-Trenaunay syndrome 

* Langer-Giedion syndrome 

* Laurence-Moon-Biedl syndrome 

* Leber Optic Atrophy 

* Leigh disease 

* Lesch-Nyhan syndrome 

* Leukodystrophy [Adrenoleukodystrophy (ALD), Alexanders Disease, CADASIL 
(Cerebral Autosomal Dominant Arteriopathy with Subcortical Infarcts & 
Leukoencephalopathy), Canavan Disease (Spongy Degeneration), 
Cerebrotendinous Xanthomatosis (CTX), Globoid Cell (Krabbes) Leukodystrophy 
Metachromatic Leukodystrophy (MLD), Ovarioleukodystrophy , Pelizaeus- 
Merzbacher Disease, Refsum Disease, van der Knaap syndrome, Zellweger 
syndrome] a 

* Limb anomalies [missing arm(s) or leg(s), Poland anomaly, other] 
Lissencephaly [Isolated Sequence (ILS), X-Linked (XLIS), Subcortical Band 
Heterotopia (SBH), Miller-Dieker syndrome (MDS), Microcephaly, 
Microlissencephaly (MLIS), Norman-Roberts syndrome (NRS), With Cerebellar 
Hypoplasia (LCH), Polymicrogyria (PMG), Schizencephaly (SCH), Muscle-Eye- 

^ Brain (MEB) Disease, and Walker-Warburg syndrome (WWS), 17p13.3 deletion] 
Liver conditions (biliary atresia, Alagille syndrome, alpha-1 antitrypsin, tyrosinemia, 
neonatal hepatitis, Wilson disease) K ' 

* Lowe syndrome 

* Lung / pulmonary conditions 

* Lymphedema 

* Maffucci syndrome(Ollier, multiple cartilaginous enchondromatosis) 
Malignant hyperthermia 

* Maple syrup urine disease 

* Marinesco-Sjogren Syndrome 

* Marfan syndrome 

* Menke syndrome 

* Mental retardation / developmental delay 

* Metabolic conditions (carbohydrate deficient glycoprotein syndrome (CDGS) 
diabetes insipidus, Fabry, galactosemia, glucose-6-phosphate dehydrogenase 
(G6PD), fatty acid oxidation disorders, glutaric aciduria, hypophosphatemia 
Krabbe, lactic acidosis, lysosomal storage diseases, mannosidosis. maple syrup 
unne, mitochondrial, neuro-metabolic, organic acidemias, PKU, purine pyruvate 
dehydrogenase deficiency, urea cycle conditions, vitamin D deficient rickets) 
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* Miscarriage, stillbirth, infant death 

* Mitochondrial conditions (Alpers, Barth, beta-oxidation defects, carnitine deficiency, 
CPEO, Kearns-Sayre, lactic acidosis, Leber optic neuropathy, Leigh, LCAD, Luft, 
MCAD, MAD, glutaric aciduria, MERRF, MNGIE, NARP, Pearson, PHD, SCAD, 
NADH-CoQ reductase, succinate dehydrogenase, Complex III, Complex IV, COX, 
Complex V, other) 

* Moebius syndrome 

* Mucolipidosis, type IV (ML4) 

* Mucopolysaccharidosis (Hunter syndrome, Hurler syndrome, Maroteaux-Lamy 
syndrome, Sanfilippo syndrome, Scheie syndrome, Morquio syndrome, other) 

* Multiple hereditary exostoses 

* Muscular dystrophy /atrophy (neuromuscular conditions including: Duchenne, 
facioscapulohumeral, Charcot Marie Tooth, spinal muscular atrophy, other) 

* Myotonic dystrophy 

* Nager & Miller syndromes 

* Nail Patella syndrome 
* * Narcolepsy 

™ * Neurologic conditions (neuro-metabolic, neurogenetics, neuromuscular, other) 

* Neurofibromatosis (von Recklinghausen) 

* Neuromuscular conditions 

* Niemann-Pick disease 

* Noonan syndrome 

* Opitz syndromes [Opitz-Frias, Opitz FG (Opitz-Kaveggia), Opitz-C 
(Trigonocephaly)] 

* Organic acidemias 

* Osler-Weber-Rendu syndrome 

* Osteogenesis imperfecta 

* Oxalosis & hyperoxaluria 

* Pallister-Hall syndrome 

* Pallister-Killian syndrome (tetrasomy 12p, Teschler-Nicola syndrome) 

* Parkinson's disease 

* Periodic paralysis 

* Phenylketonuria (PKU) 
^ * Polycystic kidney disease 

* Popliteal pterygium syndrome 

* Porphyria 

* Prader-Willi syndrome 

* Progeria (Werner, Hutchinson-Gilford, Cockayne, Rothmond-Thomson syndromes) 

* Proteus syndrome 

* Prune belly syndrome 

* Pseudoxanthoma elasticum (PXE) 

* Psychiatric conditions 

* Refsum disease 

* Retinal degeneration 

* Retinitis pigmentosa (retinal degenerative diseases, Usher syndrome) 

* Retinoblastoma 

* Rett syndrome 

* Robinow syndrome 

* Rubinstein-Taybi syndrome 

* Russell-Silver syndrome 
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* SBMA 
*SCA 

* Schizencephaly 

: tS^SSS^S^ <47J0CV ' 4W * 45 ' X and ~- 47 ^> 

* Sickle cell anemia 

* Skeletal dysplasia 

I f m 'I!?~l-f m,i " 0pitz syndrome (RHS syndrome) 
Smitn-Magenis syndrome (17p-) 

* Sotos syndrome 

* Spina bifida (myelomeningocele, neural tube defects) 

* &Tr^tt rdni9 ' Hoffman - 

* Sturge-Weber 

* Canfvan, "&Z$ 6 ' 0 *" < d > reautoTOm,a . Gaucher, Niemann Pick. 

* Thalassemia (Cooley anemia) 

* Thrombocytopenia absent radius syndrome 
Tourette syndrome 

* Treacher Collins syndrome (craniofacial) 

* *EXfi££ 9 - ° ,her - 866 ohromos ome s *""™-> 

* Turner syndrome 

* Twins / triplets / multiple births 

* Unknown disorders 

* Urea cycle conditions 

* Usher syndrome 

* VATER association 

* Von Hippel-Lindau syndrome 

* Waardenburg syndrome 

* Weaver syndrome 

* Werner syndrome 

* Williams syndrome 

* Wilson disease (hepatolenticular degeneration) 
xeroderma pigmentosum ' 
Zellweger syndrome 



The figures show: 



Figure 1 Identification of two-hybrid interactions connected to HD. a Schematic 
representation of the screening strategy, b, Identification of interactions by systematic 
interaction mating. Upper panel: Selection of diploid yeast clones by transfer on 
m,n,mal medium lacking leucine and tryptophan (SDH). Lower panel: Two-hybrid 
selecfon of interactions on minimal medium lacking leucine, tryptophan, histidine and 



uracil (SDIV) after 5 days of growth at 30°C. The prey proteins HP28 (A5), SH3GL3 
(A7), CA150 (B9), HIP15 (B10), PFN2 (B11), HIP13 (C1), CGI125 (C12) and HYPA 
(D1) were identified as HDexQ51 interactors. 

Figure 2 Protein interaction network for Huntington' s disease, a, Matrix of 117 two- 
hybrid interactions between 21 bait and 49 prey proteins, b, Yeast two-hybrid 
interactions depicted as network using the software Pivot 1 .0. In total, 96 interactions 
and 61 distinct proteins are depicted. In addition, dimers of EF1G, VIM and p53 are 
shown. 

Figure 3 Systematic validation of two-hybrid interactions by in vitro binding 
experiments. GST-fusion proteins (baits) immobilised on glutathione agarose beads 
were incubated with COS1 cell extracts containing HA-tagged prey proteins. After 
extensive washing of the beads, bound proteins were eluted and analysed by SDS- 
PAGE and immunoblotting using anti-HA antibody. 

Figure 4 Identification of network proteins stimulating htt aggregation, a, Filter 
retardation assay. Protein extracts were prepared from HEK293 cells coexpressing 
HD169Q68 and network proteins as indicated. The aggregated proteins retained on 
the filter were detected with anti-htt antibody (CAG53b) and anti-GIT1 antibody, b, 
Coimmunoprecipitation of HD510Q68 and GIT1 from COS1 cell extracts. Extracts 
were incubated with anti-GlT1 or preimmune serum. Immunoprecipitated material 
was analysed by immunoblotting using htt- antibody 4C8 and anti-HA antibody, c, 
Coimmunoprecipitation of htt and GIT1 from human brain extracts. Protein 
complexes containing GIT1 were pulled-down with increasing amounts of anti-htt 
antibodies, but not with corresponding preimmune sera, d, Analysis of subcellular 
localisation of HD510Q68 and GIT1 by immunofluorescence microscopy. COS1 cells 
were transfected with the indicated constructs and immunolabled with 4C8 anti-htt 
antibody coupled to Cy3-conjugated antibody (red) and with anti-HA antibody 
coupled to FITC-conjugated antibody (green). Nuclei were counterstained with 
Hoechst (blue). Colocalisation of HD510Q68 and GIT1 is illustrated by yellow colour 
of the insoluble aggregates. Scale bars, 10 um. 

Figure 5 Detection of GIT1 in brains of R6/1 transgenic mice and HD patients, a, 
Sections of striatum and cortex of R6/1 mice brains labelled with anti-GIT1 and anti- 



htt (EM48) antisera. Arrows point to nuclear inclusions, b, Inclusions in cortex of HD 
patients are labelled with anti-htt (2B4) and anti-GIT1 antibodies. Arrows indicate 
neuronal inclusions, recognized by anti-htt (2B4) and anti-GIT1 antibodies. Scale 
bars, 20 urn. c, Colocalisation of GIT1 and htt in the cortex of HD patients detected 
by immunofluorescence microscopy. 

Figure 6 Amino acid sequence of the interacting proteins of the PPI of huntingtin. 
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The examples illustrate the Invention: 

Examples 1: Particular methods and material used in the Examples 

• Antibodies, strains and plasmids 

A polyclonal antibody (pAb) against GIT1 was generated by injection of affinity 
purified His 6 -tagged GIT1 (residues 368-587) into a rabbit. The htt-specific pAb 
CAG53b and HD1 were described 13,14 . Commercially available antibodies were anti- 
GST pAb (Amersham Pharmacia), anti-GIT1 pAb (Santa Cruz Biotechnology), anti- 
HA monoclonal antibody 12CA5 (mAb) (Roche Diagnostics), anti-htt pAb EM48 47 , 
anti-htt mAb 2B4 48 and anti-htt mAb 4C8 (Chemicon). As secondary antibodies for 
immunofluorescence microscopy Cy3- and FITC-conjugated IgGs (Jackson 
ImmunoResearch) were used. 

The yeast strains used as two-hybrid reporters were L40ccua [MATa his3A200 trpl- 
901 Ieu2-3,112 LYS2::(lexAop) 4 -HIS3 ura3::(lexAop) 8 -lacZ ADE2::(lexAop) 8 - URA3 
GAL4 gal80 can1 cyh2] and L40cca [MATa his3A200 trp1-910 Ieu2-3,112 ade2 
LYS2::(lexAop) 4 -HIS3 URA3::(lexAop) 8 -lacZ GAL4 gal80 can1 cyh2]. Both strains are 
derivatives of L40c 17 . Two-hybrid vector maps are available at http:/ /www.mdc- 
berlin.de/neuroDrot/labeauip.htm . Plasmids pHD510Q17 and pHD510Q68 were 
generated by insertion of fragments coding for HD510Q17 and HD510Q68 into 
pcDNA-l (Invitrogen). pHD169Q68 was derived from pHD510Q68 by deletion of the 
Xhol- Xhol fragment encoding aa 170-510 of human htt. 

• Library screening 

Plasmids encoding bait proteins were transformed into the strain L40ccua, tested for 
the absence of reporter gene activity and cotransformed with a human fetal brain 
cDNA library (Clontech). For each transformation 1 x 10 6 independent transformants 
were plated onto minimal medium lacking tryptophan, leucine, histidine and uracil 
(SDIV medium) and incubated at 30°C for 5 to 10 days. Clones were picked into 
microtitre plates using a picking robot and grown over night in liquid minimal medium 
lacking tryptophan and leucine (SDII medium). Then, they were spotted onto nylon or 
nitrocellulose membranes placed on SDIV medium plates. After incubation for 4 days 



membranes were subjected to a P-galactosidase 0-GAL) assay. Plasmids were 
prepared from positive clones and characterised by restriction analyses and 
sequencing. For retransformation assays plasmids encoding bait and prey proteins 
were cotransformed in the yeast strain L40ccua and plated onto SDIV medium. 

• Array mating screen 

Plasmids encoding bait and prey proteins were transformed into strains L40ccua and 
L40cca, respectively. L40cca clones were arrayed in 96-well microtitre plates and 
m.xed with a single L40ccua clone for interaction mating. Diploid cells were 
transferred by a robot (Beckman, Biomek® 2000) onto YPD medium plates and after 
mcubation for 24 h at 30'C, onto SDII medium plates for additional 72 h at 30°C For 
two-hybrid selection diploid cells were transferred onto SDIV medium plates with and 
without nylon or nitrocellulose membranes and incubated for 5 days at 30*C The 
nylon or nitrocellulose membranes were subjected to the p-GAL assay Positive 
clones were verified by ^transformation assays using plasmids encoding respective 
bait and prey proteins. 

• Protein expression and verification assays 

For verification experiments cDNA fragments encoding baits and preys were 
subcloned into P GEX derivatives (Stratagene) or pTL-HA « GST fusion proteins 
were expressed in E. coli BL21-codon P.usTM RP (Stratagene) and affinity purified 
on glutathione agarose beads (Sigma) using standard protocols COS1 cells were 
transfected with mammalian expression plasmids and lysed as described 18 For in 
vrtro binding assays, 30 ug of GST or GST fusion protein were immobilized on 
glutathione agarose beads and incubated with 500 ug protein extract prepared from 
COS1 cells expressing a HA-tagged fusion protein for 2 h at 4»C in binding buffer [50 
mM HEPES pH 7.4, 150 mM NaCI, 10% glycerol, 1 % NP-40. 1 mM EDTA 20 mM 
NaF, 1 mM DTT, 0.1 % Triton X-100, protease inhibitors (Roche Diagnostics)] After 
centnfugation and extensive washing of the beads bound proteins were eluted and 
analysed by SDS-PAGE and Western blotting. 

Coimmunoprecipitation experiments were performed as described by Sittler et al. 18 
For immunofluorescence microscopy COS1 cells were grown on cover slips and 



cotransfected with pcDNA-HD510Q68 and pTL-HA-GIT1. 40 h post transection cells 
were fixed with 2% paraformaldehyde. Standard protocols for staining with 
appropriate primary and secondary antibodies were used 18 . 

• Filter Retardation Assay 

HEK293 cells coexpressing HD169Q68 and GIT1, PIASy, HIP5, HP28, PFN2, FEZ1 
or BARD1 were harvested 48 h post transfection. Cells were lysed as described 18 
and boiled in 2% SDS, 100 mM DTT for 5 min. Aliquots containing 50, 25 and 12.5 
ug of total protein were used for filtration on a cellulose acetate membrane 14 . SDS- 
resistant aggregates were detected using anti-CAG53b or anti-GIT1 antibodies. 

• Immunocytochemistry 

Mice were deeply anaesthetised and perfused through the left cardiac ventricle with 
4% paraformaldehyde in 0.1 M phosphate buffer. Brains were removed and postfixed 
overnight in 4% paraformaldehyde. Sections were processed for 
immunocytochemistry as described 47 . pAb EM48 (1:1000) and affinity purified anti- 
GIT1 pAb (1:100) were used as primary antibodies. 

Six human HD and 5 control brains were used in this study. Two HD cases were 
classified as grade 3 and four cases as grade 4 of neuropathological severity. For 
immunolabelling standard protocols were used 48 . 2B4 mAb (1:200) and affinity 
purified GIT1 pAb (1 :50) were used as primary antibodies. 

Example 2: Two-hybrid screens and data management 

To generate a PPI network for HD we used a combination of library and matrix yeast 
two-hybrid screens (Fig. 1a). First, 50 selected cDNAs encoding proteins potentially 
involved in HD including 10 different htt fragments were cloned into a DNA binding 
domain vector for expression of LexA fusion proteins (baits). The resulting plasmids 
were introduced into yeast strain L40ccua, which carries three reporter genes, HIS3, 
URA3 and lacZ, for two-hybrid interaction analyses. Forty baits did not activate the 
reporters by themselves and were used individually for cotransformation screening of 
a human fetal brain cDNA library expressing GAL4 activation domain hybrids (preys). 
In each screen, 1 x 10 8 auxotrophic transformants were tested on selective plates, 



and 1-50 positive colonies were typically obtained. Restriction analyses and 
sequencing identified preys that together with their respective baits repeatedly 
activated the reporter genes. Starting with 40 baits in the first round of 
^transformation screens we identified 34 PPIs for 10 baits (Table 1). 

In the second round of screens, 12 cDNA fragments encoding preys identified in the 
first screen were subcloned into a DNA binding domain vector. The resulting baits 
were tested for autoactivation and 10 were screened against a human fetal brain 
cDNA library. Four of the 10 proteins revealed additional 13 PPIs. 

Finally, an array mating screen was performed to connect all baits and preys 
.dentified in the transformation screens. For this assay, MATa yeast cultures were 
transformed with plasmids encoding prey proteins and arrayed in 96-well microtitre 
Plates for interaction mating with individual MATa strains expressing bait proteins 
Us,ng this strategy each bait was individually tested for interaction with every prey in 
the array. Diploid yeast Cones, formed by mating on YPD plates, were selected on 
agar SDH plates, and further transferred by a spotting robot on SDIV plates to select 
for Y2H interactions (Fig. 1b). We examined 3500 pairwise combinations of baits and 
preys in the mating assay and identified additional 70 PPIs. These interactions could 
be confirmed in cotransformation assays (Table 5). 

Table 5: 



Screen 


baits 


preys 


baits yielding 


interactions 


screened 


screened 


interactions 


identified 


1st transformation screen 


40 


4x1 0 7 


10 


34 


2nd transformation screen 


10 


1x10 7 


4 


13 


Array mating screen 


50 


70 


21 


70 



Thus, the combination of cDNA library and array mating screens preved power*,, in 
establishing a highly connected PPI network linked to htt. 



Sequence analysis of the cDNAs encoding bait and prey proteins revealed ORFs 
ranging from 82 to 728 amino acids in size (Table 2). In a systematic Blast search 60 
out of the 67 proteins identified were identical to a SwissProt or TrEMBL protein entry 
(http://us.expasy.org/sprot/). The remaining 7 proteins showed 75-99 % identity to its 
best fit and either contained single amino acid substitutions, variable polyQ lengths or 
small regions of sequence variation. Uncharacterised proteins were named according 
to their interaction partners. Each ORF was further examined for consensus protein 
domains using the FprintScan, HMMPfam, HMMSmart, ProfileScan, and 
BlastProDom programs providing useful hints to protein function. For example, the 
protein BAIP1 (BARD1 interacting protein 1) possesses a Zn-finger-like PHD finger 
that is believed to be important for chromatin-mediated transcriptional regulation. 
Similarly, domain searches for BAIP2 (BARD1 interacting protein 2) revealed a 
BTB/POZ domain, a motif found in developmentally regulated zinc finger proteins of 
the Kelch family of actin-associated proteins. Thus, BAIP2 could potentially mediate 
the association of BARD1 with the actin cytoskeleton. 

Example 3: Analysis and functional assignment of the two-hybrid data 

Our two-hybrid screens identified a total of 117 PPIs between 70 protein fragments. 
As a result of the iterative two-hybrid strategy all interactions could be depicted in a 
single large network. The number of interactions identified for each bait varied from 1 
to 18, with each protein having 1.6 interaction partners on average. In order to 
display the PPI data, both matrix and network representations were used (Fig. 2). 
The matrix shows, in addition to the two-hybrid interactions, previously reported 
interactions and interactions verified by independent methods (Fig. 2a). In 
comparison, the network view allows to immediately recognize local PPI patterns and 
paths connecting two proteins in the network (Fig. 2b). Interestingly, proteins such as 
htt, BARD1, GADD45G, HIP5, PIASy or VIM interact with more than 11 other 
proteins forming nodes within the HD network, while 30 proteins have only one 
interaction partner and thus are located at the periphery of the network (Fig. 2b). 
Indeed, all other proteins are embedded in many bi-fan motifs and multiple circular 
interaction clusters that have been interpreted to be an indication for biological 
relevance 11,19 . Schwikowski et al. 20 defined network proteins, which are separated 



by no more than two other proteins, as being part of a functional cluster. In this 
respect all proteins in our network form a functional cluster with htt. 

We assigned a subcellular localisation to each protein by examining various sources 
of literature and based on available experimental data we grouped the proteins into 
six broad functional categories (Fig. 2a, Table 2). 

Eighteen proteins in the HD network are involved in transcriptional regulation or DNA 
maintenance (Fig. 2a). The second largest group, 14 proteins, includes mainly 
cytoskeletal and transport proteins. We assigned 5 proteins to cellular signalling and 
fate, another 4 proteins to protein synthesis and turnover, and 3 proteins to cellular 
metabolism. Being part of 41 interactions, 16 proteins of unknown function were 
identified. 

For the analysis of htt PPIs, as much as 40 out of 1 17 interactions (34.2%) included a 
htt fragment (Fig. 2a). In total, 19 different htt interacting partners from various 
funct.onal groups were detected, 4 proteins had been previously described and 6 
involved proteins of unknown function. Surprisingly, most htt partners (6) are involved 
in transcriptional regulation and DNA maintenance, but others function in cell 
organization and transport (4), cellular signalling (2), or cellular metabolism (1) 
suggesting that htt functions in different subcellular processes. 

The current hypothesis that htt has a function in transcriptional regulation is inferred 
from its interactions with transcriptional activators, coactivators or repressors 21 In 
agreement with previous reports, binding of htt to CA150 22 and HYPA 23 has been 
detected in our screens. In addition, new connections to nuclear proteins such as 
SETBD1, FLIP and HBQ1 were found. These multidomain proteins act on histones 
and are known modulators of chromatin structure and gene expression. Similarly, the 
zinc finger bromo domain containing proteins BARD1, NAG4, HZFH. ZHX1 ZNF33B 
Play a role in transcriptional control. The protein IKAP directly interacts with htt and 
was recently shown to be part of a complex regulating RNA polymerase II activity 24 
Htt also interacts with PIASy, which inhibits transcription factor STAT-mediated gene 
activation 2 *. PlASy functions as SUMO E3 ligase for the Wnt-responsive 
transcription factor LEF1, inhibiting its activity via sumoylation « This suggests that 
PIASy catalysed sumoylation of transcription factors could represent a general 



mechanism in repression of gene expression. The binding of PIASy to htt indicates 
that htt may itself be a substrate for sumoylation. Alternatively, it could influence the 
sumoylation of other transcription factors. Thus, our data extend the nuclear role of 
htt and provide additional leads for its involvement in transcriptional regulation. 

Another large group of htt interactors identified here are proteins that function in 
cellular organization and vesicle transport. We report a new interaction between htt 
and dynein light chain (HP28), a component of the dynein/dynactin motor protein 
complex. Interestingly, the p150 Glued subunit of dynactin is linked to the htt-associated 
protein HAP1 16,27 . Our observation that htt directly binds to HP28 underscores the 
potential scaffolding role of htt/HAP1 in dynein/dynactin driven retrograde vesicle 
transport along microtubules in axons. 

The htt interacting protein HIP1 anchors clathrin-coated vesicles to the cytoskeleton 
via its actin-binding domain, a link crucial for synaptic vesicle endocytosis 28 . Here, a 
new PPI between htt and profilin II (PFN2) 29 was detected. PFN2, a protein enriched 
in neurons, modulates actin polymerization in vitro and is involved in endocytosis via 
association with scaffolding proteins 29 . The htt-PFN2 connection adds support to a 
potential role of htt in modulation of both actin polymerization and vesicle transport 
processes. 

Currently, for the function of 6 htt interactors, including HIP5, no genetic or 
biochemical evidence is available (Table 2). We found that HIPS binds to htt as well 
as to karyopherin a (KPNA2). KPNA2 serves as an adapter for karyopherin p 
(KPNB1), which transports NLS-tagged proteins into the nucleus 30 . Thus, HIP5 
might take this route to the nucleus. Interestingly, HEAT or armadillo (ARM) repeats, 
forming a-helical structures in KPNA2 and KPNB1 are also present in htt 31 . 
Therefore, the complexes between KPNA2 and HIPS as well as between htt and 
HIPS could be similar in terms of protein structure. It is tempting to further speculate 
that htt participates in nucleocytoplasmic transport. 

Example 3:Verification of PPis 

Comparison with literature-cited interactions revealed that more than 80% of the two- 
hybrid interactions identified here are novel. For all network bait and prey proteins 
only 24 PPIs have been reported previously using two-hybrid methods, 



cxjimmunoprecipitations or affinity chromatography-based techniques; 18 of these 
were confirmed in our Y2H assays (Fig 2a, Table 2). Failure to detect interactions 
may result from the high stringency of our particular two-hybrid system. However, in 
most cases the occurrence of false negatives can be explained by the lack of 
essential domains in one of the protein fragments used. For example, an interaction 
between p53 and hADA3 has been described 32 , with the first 214 amino acids of 
hADA3 being essential for this interaction. It escaped our two-hybrid analysis, 
because a C-terminal hADA3 fragment (amino acids 235-432) was used. For the 
same reason, an interaction between p53 and BARD1 or between KPNA2 and 
. KPNB1 was not observed. 

Beside false negatives, the two-hybrid assay is also prone to create false positive 
results 9 . Addressing this issue we performed a series of pull-down and overlay 
assays and thereby confirmed several of the two-hybrid PPIs independently. Proteins 
were expressed as GST-fusions in E. coli and as HA-fusions in COS1 cells. After 
immobilization of the GST-fusion protein to beads or nitrocellulose membranes the 
respective partner was affinity-purified from a COS1 cell extract and binding was 
detected by immunoblotting. Using these assays, 22 physical interactions, central to 
the HD network, were verified (Fig. 2a). The results of some in vitro GST pull-down 
assays are shown in Fig. 3. For example HD510Q17 interacts with HIP1, GIT1. 
PIASy, FEZ1 and HIP11, and HIP5 binds to HD510Q68, GIT1, HB01, PLIP and 
FE21 (Fig. 3). In total, 35 two-hybrid interactions were verified independently either in 
previous studies or by our in vitro binding assays (Fig. 2a). 

Example 4: GIT1 promotes htt aggregation in vivo 

The formation of insoluble polyQ-containing protein aggregates is a pathological 
hallmark of HD. Several lines of evidence link htt aggregation to disease progression 
and the development of motor symptoms. We screened network proteins for their 
potential to enhance htt aggregation in a cell-based aggregation assay M . In this 
assay, formation of SDS-insoluble htt aggregates in mammalian cells, that have been 
cotransfected with constructs encoding an N-terminal htt fragment with 68 glutamines 
(HD169Q68) and a network protein of interest, is monitored by filter retardation 14 . 
HD169Q68 perse has only a low propensity to form insoluble aggregates in HEK293 
cells. However, as shown in Fig. 4a coexpression of the htt-interacting protein GIT1 



strongly promotes the formation of HD169Q68 aggregates, whereas coexpression of 
PIASy, HIP5, HP28, PFN2, FEZ1 and BARD1 has no discernable effect. Thus, GIT1 
is a potential modifier of HD pathogenesis, which may influence the rate of formation 
of insoluble htt aggregates in vivo. 

Furthermore, probing of the insoluble HD169Q68 aggregates with an anti-GIT1 
antibody revealed that GIT1 does not only stimulate aggregation but is also an 
integral part of the insoluble aggregates (Fig. 4a). This suggests that GIT1 promotes 
aggregation through direct binding to mutant htt. 

The interaction between GIT1 and htt was confirmed by coimmunoprecipitation from 
COS1 cells transfected with constructs encoding HD510Q68 and HA-GIT1. Forty 
hours post transfection cell extracts were prepared and treated with antiserum 
against GIT1. HD510Q68 and HA-GIT1 were detected in the immunoprecipitate on 
Western blots with anti-htt antibody 4C8 and anti-HA antibody 12CA5, respectively 
(Fig. 4b). 

The GIT1-htt interaction was also detected in human brain. Protein extracts prepared 
from human cortex were treated with the anti-htt antibodies CAG53b and HD1, and 
the precipitate was probed for the presence of GIT1 (Fig. 4c). Full length GIT1, 
migrating at about 90 kDa 33 , was precipitated by both anti-htt antibodies in a 
concentration dependent manner, indicating the existence of a complex between htt 
and GIT1 in neurons. 

Finally, we performed colocalisation studies of htt and GIT1 in COS1 cells using 
immunofluorescence microscopy. In cells expressing HD510Q68 or GIT1 alone a 
diffuse cytoplasmic staining was observed for each protein (Fig. 4d). However, when 
GIT1 and mutant htt were coexpressed, large perinuclear structures, most likely 
reflecting protein aggregates, appeared almost exclusively. These structures 
contained both GIT1 and htt. The images further substantiate the findings that GIT1 
and htt bind to each other and that GIT1 is a potent enhancer of mutant htt 
aggregation. 

Example 5: GIT1 localises to htt aggregates in HD transgenic mouse and 
patient brains 



The finding of colocalisation of htt and GIT1 within aggregates in transfected COS1 
cells suggests that GIT1 might also be a component of htt aggregates in vivo. To 
investigate this possibility we first assessed the distribution of GIT1 in brains of R6/1 
transgenic mice expressing a human htt exon 1 protein with 150 glutamines **. In 
wildtype mice, GIT1 immunoreaction product was found diffuse in the cytoplasm and 
nuclei of neurons throughout the brain. In R6/1 brains, in addition to the diffuse 
staining, GIT1 immunoreactivity was also present in large nuclear and cytoplasmic 
puncta similar to htt aggregates (Figure 5a). To further confirm these data, we 
examined the subcellular distribution of GIT1 in cortex from HD patient brains and 
healthy individuals (Fig. 5b). In patient brains, GIT1 antibodies labelled neuronal 
nuclear inclusions as well as neuropil aggregates characteristic of HD brains * In 
contrast, neurons from control brains only showed a diffuse nuclear and cytoplasmic 
GIT1 immunostaining. In fact, in colocalisation studies performed in HD brain 
sections. GIT1 positive aggregates were also labelled with anti-htt antibody 2B4, 
indicating that both proteins coaggregated in vivo (Fig. 5c). This observation raises 
the possibility that an alteration of the neuronal GIT1 subcellular distribution 
contributes to HD pathogenesis. 
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Claims 

1 . A method for generating a network of direct and indirect interaction partners of a 
disease-related (poly)peptide comprising the steps of 

(a) contacting a selection of (poly)peptides suspected to contain one or several 
of sa,d direct or indirect interaction partners with said disease-related 
(polypeptides and optionally with known direct or indirect interaction 
partners of said disease-related (polypeptide under conditions that allow 
the mteraction between interaction partners to occur 

(b) detecting (polypeptides that interact with said disease-related 
(poly)peptide or with said known direct or indirect interaction partners of 
said disease-related (polypeptide; 

(c) contacting (polypeptides detected in step (b) with a selection of 
(polypeptides suspected to contain one or several (polypeptides 
interacting with said (polypeptides detected in step (b) under conditions 
that allow the interaction between interaction partners to occur 

(d) detecting proteins that interact with said (polypeptides detected in step (b)- 

(e) contacting said disease-related (polypeptide and optionally said known 
d.rect or indirect interaction partners of said disease-related (polypeptide 
said (polypeptides detected in steps (b) and (d) and a selection of proteins 
suspected to contain one or several (polypeptides interacting with any of 
the afore mentioned (polypeptides under conditions that allow the 
interaction between interaction partners to occur; 

(f) detecting (polypeptides that interact with said disease-related 
(polypeptide and optionally said known direct or indirect interaction 
partners of said disease-related (polypeptide or with said (polypeptides 
identified in step (b) or (d); and 

(9) generating a (poly)peptide-(poly) P eptide interaction network of said 
disease-related (polypeptide and optionally said known direct or indirect 
interaction partners of said disease-related (polypeptide and said 
(polypeptides identified in steps (b), (d) and (f). 

The method of claim 1. wherein said contacting step (e) is effected in an 
interaction mating two hybrid approach. 



3. The method of claim 1 or 2, said method comprising after step (d) and before step 
(e) the steps of: 

(d 1 ) contacting (poly)peptides detected in step (d) with a selection of 
(polypeptides suspected to contain one or several (poly)peptides interacting 
with said (polypeptides detected in step (d) under conditions that allow the 
interaction between interaction partners to occur; and 

(d") detecting proteins that interact with said (polypeptides detected in step (d'). 

4. The method of any one of claims 1 to 3, wherein said disease-related protein is a 
protein suspected of being a causative agent of a hereditary disease. 

5. The method of any one of claims 1 to 4, wherein said disease-related protein is 
huntingtin and wherein said interaction partners are the interaction partners as 
shown in tables 1 and 2. 

6. The method of any one of claims 1 to 5, said method comprising the step of 
determining the nucleotide sequence of a nucleic acid molecule encoding a direct 
or indirect interaction partner of the disease related protein. 

7. The method of any one of claims 1 to 6. wherein said selections of proteins are 
translated from a nucleic acid library. 

8. The method of any one of claims 1 to 7, wherein said selection of proteins in step 
(a) and/or (c) and/or (d') and/or (e) is the same selection or a selection from the 
same source. 

9. The method of any one of claims 1 to 7, wherein said selection of proteins in step 
(a) and/or (c) and/or (d 1 ) and/or (e) is a different selection or a selection from a 
different source. 



The method of any one of claims 1 to 9, wherein said method is performed by 
contacting the proteins on an array. 



11. The method of any one of claims 1 to 10, wherein said interactions are detected 
by using the yeast two-hybrid system. 

12. The method of any one of claims 1 to 11, containing after step (b), (d), (d") or (f) 
the additional steps of isolating a nucleic acid molecule with homology to said 
cDNA expressing the encoded protein and testing it for its activity as a modulator 
of huntingtin, wherein said nucleic acid molecule is DNA, or RNA, preferably 
cDNA, or genomic or synthetic DNA or mRNA. 

13. A nucleic acid molecule encoding a modulator of huntingtin, wherein said 
modulator is a protein selected from table 3. 

14. The nucleic acid molecule of claim 13, wherein said nucleic acid molecule is DNA 
preferably cDNA, genomic DNA, or synthetic DNA or RNA, preferably mRNA. 

15. The nucleic acid molecule of claim 13 or 14 fused to a heterologous nucleic acid 
molecule. 

16. The nucleic acid molecule of claim 15, wherein the heterologous nucleic acid 
molecule encodes a heterologous (polypeptide. 

17. A vector comprising the nucleic acid molecule of any one of claims 1 3 to 16. 

18. A host cell containing the nucleic acid molecule of any one of claims 13 to 16 or 
the vector of claim 17. 

19. A method of producing a (polypeptide, comprtsing culturing the host cell of claim 
18 under conditions such that the (polypeptide encoded by said polynucleotide is 
expressed and recovering said (polypeptide. 

20. A (poly)peptide comprising an amino acid sequence encoded by a nucleic acid 
molecule of any one of claims 13 to 16, or which is chemicaily synthesized, or is 
obtainable from the host cell of claim 18. or which is obtainable by the method of 
claim 18. 



21. The (poly)peptide of claim 20 fused to a heterologous (poly)peptide. 



22. A protein complex comprising at least two proteins, wherein the two proteins are 
selected from the group of interaction partners listed in table 4. 

23. An antibody specifically recognizing the (polypeptide of claim 20 or 21 or 
specifically reacting with the protein complex of claim 22. 

24. The antibody of claim 23 which is polyclonal, monoclonal, chimeric, single chain, 
single chain Fv, human antibody, humanized antibody, or Fab fragment. 

25. A method of identifying whether a protein promotes huntingtin aggregation, 
comprising 

(a) transfecting a first cell with a nucleic acid molecule encoding a variant of the 
huntingtin protein or a fragment thereof capable of forming huntingtin 
aggregates; 

(b) co-transfecting a second cell with 

(i.) a nucleic acid molecule encoding a variant of the huntingtin protein 
or a fragment thereof capable of forming huntingtin aggregates; and 

(ii.) a nucleic acid molecule encoding a candidate modulator protein 
identified by the methods of any one of claims 1 to 12 or a nucleic 
acid molecule encoding a modulator protein selected from table 1 or 
table 2; 

(c) expressing the proteins encoded by the transfected nucleic acid molecule of 
(a) and (b); 

(d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) 
and (b); and 

(e) determining the amount of insoluble huntingtin aggregates from the 
transfected cell of (a) and (b) 

wherein an increased amount of huntingtin aggregates isolated from the 
transfected cells of (b) in comparison with the amount of huntingtin aggregates 
isolated from the transfected cells of (a) is indicative of a protein's activity as an 
enhancer of huntingtin aggregation. 



6 5 



26.A method of identifying whether a protein inhibits huntingtin aggregation, 
comprising 

(a) transfecting a first cell with a nucleic acid molecule encoding a variant of the 
huntingtin protein or a fragment thereof capable of forming huntingtin 
aggregates; 

(b) co-transfecting a second cell with 

(i.) a nucleic acid molecule encoding a variant of the huntingtin 
protein or a fragment thereof capable of forming huntingtin 
aggregates ; and 

(ii.) a nucleic acid molecule encoding a candidate modulator protein 
identified by the methods of any one of claims 1 to 12 or a 
nucleic acid molecule encoding a modulator protein selected 
from table 1 or table 2; 

(c) expressing the proteins encoded by the transfected nucleic acid molecule 
of (a) and (b); 

(d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) 
and (b); and 

(e) determining the amount of insoluble huntingtin aggregates from the 
transfected cell of (a) and (b) 

wherein a reduced amount of huntingtin aggregates isolated from the 
transfected cells of (b) in comparison with the amount of huntingtin aggregates 
isolated from the transfected cells of (a) is indicative of a protein's activity as 
an inhibitor of huntingtin aggregation. 



27. 



28. 



29. 



The method of claim 25 or 26, wherein prior to step (d) the cells are treated 
with an ionic detergent. 

The method of any one of claims 25 to 27, wherein the huntingtin aggregates 
are filtered onto a membrane. 

A method for identifying compounds affecting an interaction of huntingtin or of 
a direct or indirect interaction partner of huntingtin comprising 
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(a) contacting interacting proteins selected from the group of interacting 
proteins listed in table 1 and/or table 2 in the presence or absence of an 
potential modular of interaction; 

(b) identifying compounds capable of modulating said interaction. 

30. The method of any one of claims 25 to 29 , further comprising 

(a) modeling said compound by peptidomentics and 

(b) chemically synthesizing the modeled compound. 

31. The method of any one of claims 25 to 30, wherein said compound is further 
modified to achieve 

(i) modified site of action, spectrum of activity, organ specificity, and/or 

(ii) improved potency, and/or 

(iii) decreased toxicity (improved therapeutic index), and/or 

(iv) decreased side effects, and/or 

(v) modified onset of therapeutic action, duration of effect, and/or 

(vi) modified pharmakinetic parameters (resorption, distribution, metabolism 
and excretion), and/or 

(vii) modified physico-chemical parameters (solubility, hygroscopicity, color, 
taste, odor, stability, state), and/or 

(viii) improved general specificity, organ/tissue specificity, and/or 

(ix) optimized application form and route 
by 

(i) esterification of carboxyl groups, or 

(ii) esterification of hydroxy! groups with carbon acids, or 

(iii) esterification of hydroxyl groups to, e.g. phosphates, pyrophosphates or 
sulfates or hemi succinates, or 

(iv) formation of pharmaceutical^ acceptable salts, or 

(v) formation of pharmaceutically acceptable complexes, or 

(vi) synthesis of pharmacologically active polymers, or 

(vii) introduction of hydrophilic moieties, or 

(viii) introduction/exchange of substituents on aromates or side chains, 
change of substituent pattern, or 

(ix) modification by introduction of isosteric or bioisosteric moieties, or 
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(x) synthesis of homologous compounds, or 

(xi) introduction of branched side chains, or 

(xii) conversion of alkyl substituents to cyclic analogues, or 

(xiii) derivatisation of hydroxy! group to ketales, acetates, or 

(xiv) N-acetylation to amides, phenylcarbamates, or 

(xv) synthesis of Mannich bases, imines, or 

(xvi) transformation of ketones or aldehydes to Schiffs bases, oximes, 
acetates, ketales, enolesters, oxazolidines, thiozolidines or 
combinations thereof. 

A method of diagnosing Huntington's disease in a biological sample 
comprising the steps of 

(a) contacting the sample with an antibody specific for a protein of table 1 
or 2 or an antibody specific for the protein complex of claim 22; and 

(b) detecting binding of the antibody to a protein complex, 

wherein the detection of binding is indicative of Huntington's disease or of a 
predisposition to develop Huntington's disease. 

33. The method of claim 32, wherein 

(a) said protein complex contains GIT1 or 

(b) said antibody is specific for a protein complex containing GIT1. 



32. 



34. 



35. 



A diagnostic agent/composition or pharmaceutical composition comprising the 
nucleic acid molecule of any one of claims 13 to 16, the (polypeptide of claim 
20 or 21 or the (polypeptide mentioned in anyone of tables 1 and 2 the 
antibody of claim 23 or 24, an antibody specifically reacting with a protein 
selected from table 2 and/or a protein selected from table 2. 

Use of the molecule of any one of claims 13 to 16, the (polypeptide of claim 
20 or 21 or the (polypeptide mentioned in anyone of tables 1 and 2 the 
antibody of claim 23 or 24, an antibody specifically reacting with a protein 
selected from table 2 and/or a protein selected from table 2, for the 
preparation of a pharmaceutical composition for the treatment of Huntington's 
disease. 
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The present invention relates to a method for generating a network of direct and 
indirect interaction partners of a disease-related (polypeptide comprising the steps of 
(a) contacting a selection of (poly)peptides suspected to contain one or several of 
said direct or indirect interaction partners with said disease-related (poly)peptides 
and optionally with known direct or indirect interaction partners of said disease- 
related (poly)peptide under conditions that allow the interaction between interaction 
partners to occur; (b) detecting (poly)peptides that interact with said disease-related 
(poly)peptide or with said known direct or indirect interaction partners of said 
disease-related (poly)peptide;(c) contacting (poly)peptides detected in step (b) with a 
selection of (polypeptides suspected to contain one or several (polypeptides 
interacting with said (polypeptides detected in step (b) under conditions that allow 
the interaction between interaction partners to occur; (d) detecting proteins that 
interact with said (poly)peptides detected in step (b); (e) contacting said disease- 
related (polypeptide and optionally said known direct or indirect interaction partners 
of said disease-related (polypeptide, said (polypeptides detected in steps (b) and 
(d) and a selection of proteins suspected to contain one or several (polypeptides 
interacting with any of the afore mentioned (poly)peptides under conditions that allow 
the interaction between interaction partners to occur; (f) detecting (poly)peptides that 
interact with said disease-related (polypeptide and optionally said known direct or 
indirect interaction partners of said disease-related (poly)peptide or with said 
(polypeptides identified in step (b) or (d); and (g) generating a (polypeptide- 
polypeptide interaction network of said disease-related (poly)peptide and optionally 
said known direct or indirect interaction partners of said disease-related (poly)peptide 
and said (poly)peptides identified in steps (b), (d) and (f). Moreover, the present 
invention relates to a protein complex comprising at least two proteins and to 
methods for identifying compounds interfering with an interaction of said proteins. 
Finally, the present invention relates to a pharmaceutical composition and to the use 
of compounds identified by the present invention for the preparation of a 
pharmaceutical composition for the treatment of Huntington's disease. 
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>ALEX2 



liQQGCPVRLCRVC '^SPBKBKMNPLEQYERKVNA.SVPQVSLSTHRRPRGMSWHQ 
>HAIP1 

LKKGRGQKERRGSRGRGRRTSEE^SJ^ 
>BAIP2 



MV^CASLKRSDFQE^CVFNDEIY^S^^^ 



>BARD1 




Figure 6 
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>CGI-125 

FDASAROTARVSGIXLCQAGG^ 

QRGYPKDKAFVNYLKYLLyWKDPEYAKYLKYPQCLHMLELLQY^ 

WQHYSRKRMRLQQALAEQQQQNNTSGK 

>CGI-74 

VE KARAKKREAEEVYRltf SMPAS S FQQQKLR VCEVC S A YLGLHDNDRRLADHFGGKLHLGP I E I REK 

LEELKRWAEKQEKRNQERIiKKREEREREEREKLRRSRSHSKNPKR 

>CLH-17 

MAQILPIRFQEHLQLQNIiGINPANIGFSTLT^ 

SADSAIMNPASKVI ALKAGKTLQIFNI EMKSKMKMTiVrroDVTFWKWI SLNTVALVTDNAVVHWSM 
EGESQPVKMFDRHSSLAGCQI INYRTDAKQKWIiLIiTGI SAQQNRWGAMQLYSVDRKVSQPIEGHA 
ASPAQFKMEGNAEESTLFCFAVRGQAGGKIiHI I EVGTPPTGNQPFPKKAVDVFFPPEAQNDFPVAM 
QI SEKHDWFLITKYGYIHLYDLET 
>CLK1 

DAWVLEHIiNTTDPNS TFRCVQMLEWFEHHGHI C I VFELIiGLSTYDF I KENGFLPFRLDHI RKMAYQ 

I CKSVNFLHSNKLTHTDLKPENI LFVQSD YTEAYNPKI KRDERTL I NPD I KWDFGSAT YDDEHHS 

TLVSTRHYRAPEVIIiALGWSQPCDVWSIGCIM 

QKTRKRKYFHHDRIJDWDEHSSAGRYVSRRCKPLKE^ 

EALKHPFFDLIjKKS I 

>DRP-1 

KDNPTL I PEGTVNGI EERMTVVWDKAVATGKMDENQFVAVTSTNAAKI FNIiYPRKGRI AVGSDADW 
IWDPDKLKTITAKSHKSAVEYNIFEGMECHGS PLWI SQGKI VFEDGNINVNKGMGRFI PRKAFPE 
HLYQRVKIRNKVFGLQGVSRGMYDGPVYEn^PATPKYATPAPSl^KS S P SKHQPPPIRNIjHQSNFSIiS 
GAQIDDNNPRRTGHRIVAPPGGRSNITSLG 
>EF1A 

MHHEALSEALPGDNVGFNVKNVS VKDVRRGNVAGDS KNDPPMEAAGFTAQVI I UflHPGQI S AGYAP 

VLDCHTAHIACKFAELKEKIDRRSGKKLEDGPKTL 

VRDMRQTVAVGVI KAVDKKAAGAGKVTKS AQKAQKAK 

>EFlG(bait) 

AAGTLYTYPENWRAFKALIAAQYSGAQVRVLSAPPHra 
CVFESJSLklAYYVSNEEI^GST^ 

EVRRI LGLliDAYLiKTRTFIiVGERVTLADI TVVCTLLWIiYKQVIfEPS FRQAFPNTNRWFI/TC INQPQ 

FRAVLGEVKLCEKMAQFDAKKFAETQPKK^ 

CEQAIiAAEPKAKDPFAHLPKSTFVLDEFKRKYSNEDTIj 

LTQTFMSCJ^I TGMFQRIiDKLRKNAFASVILFGTNNS SSI SGVWVFRGQEIAFPLS PDWQVDYE S Y 

TWRKLDPGSEETQTLVREYFSPJEGAFQHVGKAFNQGKIFK 

>EF1G (prey) 

AAGTLYTYPEKTCroAFKALIAAQYSGA 

CVFESNAI AYYVSNEELRGSTPEAAAQVVQWVS FADSD I VPPASTWVFPTI^IMHHNKQATENAKE 
EVRRI LGLLD AYLKTRTFLVGERVTIiAD I TWCTLLWIj YKQVLEPS FRQAFPNTNRWFLTC INQPQ 
FRAVLGEVKLCEKMAQFDAKKFAETQPKKDTPRKEKGSREEKQKPQAERKEE 
CEQAIAAEPKAKDPFAHLPKSTFVI43EFKRKYSNEDTL,S 

LTQTFMS CNIiI TGMFQRIiDKLRKNAFASVILFGTNWSS S I SGVWVFRGQELAFPLSPDWQVDYES Y 

TWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK 

>FEZ1 

GNC SDTE I HE KEEEEFNE KSENDSG INE E PIjIjTADQVI EEI EEMMQNB PD PEEEEEVLEEEDGGET 
SSQADSVIaLQEMQALTQTFNNNWSYEGIiR^ 

BKE VKNS FI TVIiI BVQNKQKEQREIaMKKRRKEKGLSIjQSSRIEKGNQMPLKRFSiVlEGI SNI L>QSG I 
RQTFGSSGTDKQYI^NTVI P YEKKAS PPSVEDLQMI/TNI IiFAMKEDNEKVPTLIiTDYIIiKVLCPT 



Figure 6 (continued) 
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>G45IP1 

massggelgslfd: 



^K^KEVISSVPKI.PBoSS?SSSkSS^^ IMIOPLLPDISKTOMHODSLNTTA 
KDVDALLKKSEAQHEQPEDGCPFGAL' 



Figure 6 (continued) 
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>HD1.7 

MATLEKLMKAFESLKSFQQQQQQQQQQQQQQQQQQQQQQQpppppppppppQLpQppp^Qp^pQ 

PQPPPPPPPPPPGPAVABBPI^PKKEr,SATKKDRVNHCLTICENIVAQSVRNSPBFQKI^I^ 
LFLLC SDDAE SDVRMVADECLNKVI KALMDSOTjPRIjQIjEIjYKE I KKNGAPRSLRAALWRFAELAHL 
VRPQKC^YLVNLLPCLTRTSKRPEESV^^^ 

S PT I RRTAAGS AVS ICQHSRRTQ YFY S WLLNVIjLGLLVPVEDEHSTIili I LGVLLTLRYL VPLLQQQ 
VKDTSLKGSFGVTRKEMEVSPSAEQLVQVYELTLHHTQHQDHNVVTG^El^ 

LTAVGGIGQLTAAKEESGGRSRSGSIVELIAGGGSSCSPVLSRKQKGKVIjLGEEEALEDDSESRSD 

VSS SAI4TASVKDEI SGELAASSGVSTPGSAGHDI I TEQPRSQHTAGGLSGSGQL 

>HDdl.O 

MATLEKLMKAFESLKSFQQQQQQQQQQQQQQQQQQQQQQQpppppppppppQ^pQpppQj^Qp^pQ 

PQPPPPPPPPPPGPAVAEEPLHRPKKELSATKKDRVNHCLTICENIVAQSVRNSPEFQKIjIjGI 
LFI^CSDDAESDVRMVADECLNOT^ 

VRPQKCRP YLVNLLPCLTRTS KRPEE S VQETLAAAVPKIMASFGNFANDNE IKVLLKAF I AMLKS S 
SPT IRRTAAGSAVS I CQHSRRTQYFYSWLLJTVIJjGLLVPVEDEHSTIiLILGVIjIiTLSR 
>HDdl.3 

PRLQLELYKE I KKNGAPRSLRAALWRFAELAHLVRPQKCRPYLVNLLP 

AAVPKIMAS FGNFANDNE I KVLLKAF I ANLKS S S PT I RRTAAGS AVS I CQHSRRTQYFYSWLLNVT* 
LGLIjVPVEDEHSTLL I LGVLLTLRYL VPIjLQQQVKDT SIiKGS FGT\rrRKEMEVS P SAEQLVQ VYELtT 
LHHTQHQDHNVVTGALELIjQQIjFRTPPPELIIjQTLiTAVGGI GQLTAAKEESGGRSRSGS I VELI AGG 
GSSCSPVLSRKQKGKVLIiGEEEALEDDSESRSDVS SSALTASVKDE I SGELAAS SGVSTPGSAGHD 
I ITEQPRSQHTAGGLSGSGQL 
>HDexQ20 

MATIjEKNIMKAFESljKSFQQQQQQQQQQQQQQQQQQQQpppppppppppppQLpQpppQ^QpLLpQp 

QPPPPPPPPPPGPAVAEEPLHRP 

>HDexQ51 

MATLEKIiMKAFESIjKSFQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQqqqqqqqqqqqqqqqqq 

QQPPPPPPPPPPPQLPQPPPQAQPIiIiPQPQPPPPPPPPPPGPAVTxRSRCTDREST 
>HIP1 

ADTLQGHRDRFMEQFTKLKDIjFYRS SNIiQYFKRL I Q I PQL PENP FNFLRAS ALS EHI SPVWI PAE 
ASS PDSEPVIiEKDDLMDMDAS 

I SGLKAQLENMKTE SQRWLQLKGHVS ELEADLAEQQHLRQQAADDCEFLRAELDELiRRQREDTE K 
AQRSLS E I ERKAQANEQRYS KLKBKYSEIjVQNHADLLRKNAEVTKQVSMARQAQVDIjEREKKELED 
Sl^RISDQGQRXTQEQLEVLESLKQELATSQRELQ\^ 

SGAAHREEELSAIiRKELQDTQLKLASTEESMCQIAKDQRKMLLVG 
>HIP11 

VDLiVTACD I RYCAQDAFFQVKEVDVGLiAAD VGTLQRLPKV I GNQSDVNEIiAFTARKMMADEAIjGS G 
LVSRVFPDKEVMLDAALAIAAE ISBKB PVAVQSTKVNLIjYSRDHS VAESIjNYVASVTITOSMLQTQDIj 
VKSVQATTENKELKTVTFSKL 
>HIP13 

PCCSEDTIPSQVSDYDYFSVSGDQEADQQEFDKSSTlPRNSDISQSYRRWFQAKRPASTAGLPTTIi 
GPAMVTPGVATIRRTPSTKPSVRRGTIGAGPIPIOTPVIPVKTPTVPDLPGV^ 

SPESPSVGEGPQGVTSMPS5MWSGQASVNPPLPGPKPSIPEEHRQAIPESEAEDQEREPPSATVSP 

GQIPESDPADLSPRDTPQGEDMLNAIRRGVKXjK3CTTTNDRSAPRFS 

>HIP15 

I^MAPPYPSTL^IETFICQV^ 

TAHARTKFHVLKMLI^SENPAVAKKLFSAKALS IPVGLFNIEETNDNIQIVIKMFQNISNIIKSG 
KMSLIDDDFSLEPIiISAFREFEELAKQLQAQIDNQNDPEVGQQS 



Figure 6 (continued) 
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>HIP16 

POSSESS 




LLEKRECRTSSCRDKR -1- <- TL S AE EQKI IJ3 S tiNDLNERmy I QE S I CKNP S IKNTLOI I P 

>HIP5 (parey) 




^SKHVLPTEHSIWQWNQESS^^ 
^^^R^QKATBkqhJ^S^I^^ 




ILTAYASAVGIGTTQVQPE YCI ^GDLEItAAKFVNQLKQESRRVAQDWLKEAI^TliETKQIVE 



>KP28 

PP^SI.LKYDTPVLVSRNTEKRSPKARI.r.KVSPQQPGp 




Figure 6 (continued) 
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>HYPA 

GRRRSSLSPTMRPGTGAERGGLMMGHPGMHYAPMG^PMGQRANMPPVPHGMMPQMM&PMGGPPMG 

QMPGI^SSVMPGMMMSHMSQASMQPAllPPGVNSMDVAAGTASGAKS^WTEHKSPIX3RTYYYNTETK 

QSTWEKPDDLKTPAEQLLSKCPWKEYKSDSGKPYyYNSQTKESRWAKPKELEDLEGYQNTIVAGSL 
ITKSNLHAMIKAEESSKQEECTTTSTAPVPTTEIPTTMSTM^ 

NASTSASNTVSGTVPVVPEPBVTSIVATVVDNEIJTVTISTEEQAQLTSTPAIQDQSVEVS9NTGEE 

TSKQETVADFTPKKEBEESQPAKKTYTWNTKEEAKQAFra 

SAIiAKLSEKKQAFNAYKVQRJCKKEKKKKKKK 

>HZPH 

HARFAEAECIJ^SHQHLSKESIJUSNKPANAVIJIKVLNQLEE 

l^MSERSILSRIASKGTEPHPTPAYPPGPYATPPGYGAAFSAAPVGALAAAGANYSQMPAGSPTTA 

ATNGPPVLVKKEKEMVGALVSDGLDRKEPRAGEVICIDD 

>IKAP 

LKEGS PLEDliAliLEALSEWQNTENLKDEVYHI LKVLFLFEFDEQGRELQKAFEDTLQLMERS LPE 

IWTLTYQQNSATPVLGPNSTANSIMASYQQQKTSVPVLDAELFIPPKINRRTQWKLSLLD 
>IMPD2 

DFLILPGYIDFTADQVDLTSALTKKITLKTPLVSSPMDTVTEAGMAIAMALTGGIGPIHHNCTPEP 
QANEVRKVKKYEQGFI TDPWL SPKDRVRDVPEAKARHGFCG I P ITDTGRMG S RLVG 1 1 S SRD IDF 

LKEEEHDCFLEEIMTKREDLWAPAGITIJCEANEILQRSKKGKLPIVNEDDELVAIIARTDLKKim 

DYPIAS^AKKQLLCGAAIGTHEDDKYRLDLLAQAGVDVVVLDSSQGNSIFQINMIKYIKDKYPNL 

QVIGGNVVTAAQAKNLIDAGVDALRVGMGSGSICITQE\nLACGRPQATAVYKVSEYAPJlPGVPVlA 

DGGlQIWGHIAKAIALGASTVMMGSLLAATTEAPGEYFFSDGIRLKKYRGMGSIiM 

YFSEADKIKVAQGVSGAVQDKGSIHKFVPYLIAGIQHSCQDIGAKSLTQVRAMMYSGBLKPEKRTS 
S AQ VEGGVHSLHS YE KRXj F 

>KPNA2 

AWALTNI ASGTSEQTKAVVDGGAI PAF I SLIjAS PHAHI SEQAVWALGNT AGDGSVFRDLVI KYGAV 
DPLI^l^VPDMSSLACGYLRN^^ 

WAISYIiTDGPNERIGMVVKTGVVPQLVKLLGASELPIVTPTULiRA 
VPPSLLTNPKTNIQKEATWTMSNIT^ 

T5GGTVEQIVYLVHCGI IEPLMNIiLTAKDTKI ILVILDAISNIFQAABKLGETEKLSIMIEECGGl* 

DKIEALQNHENESVYKASLSLIEKYPSVEEEEDQNVVPETTSEGYTPQVQDGAPGTPNP 
>KPNB1 

IAAVGIiVGDLCRALQSNI I PFCDEVMQIjIiIjENIjGNENVHRS VKPQILSVFGDI A1AI GGEFKKYLE 
VVLNTLQQASQAQVDKSDYDMVDYIiNELR^ 

P I DHI AGDEDHTDGWACAAGIj I GDLiCTAFGKDVIiKLVEARPMI HELLTEGRRSKTNKAKTIjATWA 

TKELRKLKNQA 

>KU70 

KTRTFNTSTGGLLLP SDTKRSQI YGS RQ 1 1 LEKE ETEE LKRFDDPGIiMIjMGFKPLVLIjKKHHYIjRP 

SLFVYPEE SliVIGS STLFSALLI KCLEKEVAAIiCRYTPRRNl PP YFVAIjVPQEEEIjDDQKI QVTPP 

GPQLVPLPFADDKRKMPFTEKIMATPEQVGKMKAIVEKIiRFT^ 

LMEPEOAVDLTLPK^mAMJJKRLGSLVDEFKEL^ 

ELKTHlSKGTI^KFWPlvn^KEACRA^ 

>LUC7B1 

VDAVAVDAAAVS AKAEKVHELNEKI GKLIAKAEQLGAEGNVDE S QKI IJ4E VEKVRAKKKEAEEEYR 
NSMPASS FQQQKLRVCEVCSAYLGIiHDNDRRI^ 

QDRLRRREEREREERLSRRSGSRTRDRRRSRSRDRRRRRSRSTSRERRKLSRSRSRDRHRRHRSRS 
RSHSRGHRRASRDRSAKYKPSRERASREESWE5GRSERGPPDWRLESSNGKMASRRSEEKEAGEI 



Figure 6 (continued) 
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>MAGEH1 

====== 

>mp53 
>NEFL» 




2ESEEA 

>p53 ^^-^wtt&v$*W*X\JU\JJ 



Figure 6 (continued) 
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>PFN2 

APRRPRCSAKGSKMAGWQSYVDNLMCDGCCQEAAIVOYCDAKyvWAATAGGVFQSITPIBIDMlVG 

KDREGFPTNGLTLGAKKCSVIRDSLYVDGDCTMDIRTKSQGGEPTYNVAVGRAQRVLVPVMGKEGV 
HGGGLNKKAYSMAKyLRDSGF vi^i^^v 

>PlASy (bait) 

LVEAKNMVMSFRVSDIjQMLLGFVGRSKSGIjKHEXjVTRA 

SEPAPQPHRPLDPLTMHSTYDRAQAVPRTPIAGPNIDYPVLYGKYI^GI^RI^PAKTLKPEVRLVKI. 
PFFNMLDELLKPTELVPQNNEKLQESPCIPALTPRQVELIRNSRELQPGVKAVQVVLRICYSDTSC 
PQE^YPPNIAVKimHSYCSVPGYYPSNKPGVEPKRPCRPI^^ 

YS VAL YLVRQLTS S ELLQRLKT I GVKHPELCKALVKEKLRLDPDS E IATTGVRVSLI CPLVKMRL S 
VPCRAETCAHLQCFDAVFYI^JMNEKKPTWMCPVCDKPAPYDQLIIDGLLSKILSECEDADEIEYLV 
DGSWCPIRAEKERSCSPQGAILVLGPSDANGLLPAPSVNGSGAIiGSTGGGGPVGSMENGKPGADVV 
DLTLDSS SSSEDEEEEEEEEEDEDEEGPRPKRRCPFQKGLVPAC 
>PIASy (prey) 

LVEAKIJl^SFRVSDLQMLLGFVGRSKSGLKHELVTRALQLVQFDCSPELFKKIKELYETRYAKKN 
SEPAPQPHRPLDPLTOmSTVDRAGAVPRTPIAGPNIDYPVLYGKYI^GLGRLPAKTLKPEVRLVKL 
PFFNMLDE LLKPTELVPQNNEKIiQBS PCI FALTPRQVELIRNSRELQPGVKAVQ WLRI CYSDTS C 

PQEDQYPPNIAVKViraSYCSVPGYYPSNKPGVEPKRPCRPINLTHJJWYLSSATNRITVTWGNYGKS 
YS VALYLVRQLTSSEliLQRLKT I GVKHPELCKALVKEKLRLDPDSEI ATTGVRVSLI CPLVKMRLS 

^CRAETCAHLQCFDAVFYUiMNEKKPTWMCPVCDKPAPYDQLIIDGI^SKILSECE^EIEYLV 
IX3SWCPIRAEKERSCSPQGAILVLGPSDANGLLPAPSVNGSGALGSTGGGGPVGSMENGKPGADW 
DLTLDSSSSSEDEEEEEEEEEDEDEEGPRPKRRCPFQKGLVPAC . 
>PLIP 

GE 1 1 EGCRliPVLRRNQDNEDEW PLAE I LS VKDI S GRKLFYVHYIDFNKRXiDEWVTHERLDliKKIQF 
PKKEAKTPTKNGLPGSRPGSPEREVKRKVEVVSPATPVPSETAPASVFPQNGAARRAVAAQPGRKR 
KSNCLGTDEDSQDSSDGIPSAPRMTGSLVSDRSlBDDJVlllMKNIEClElXSRHRLKPWYFSPYPQEIj 
TTLP^YLCEFCI^GRSLKCLQRHLTKCDI^PPGl^IYRKGTISFFEIDGRKNKSYSQNLCLLA 
KCFLDHKTLYYDTDPFLFYVMTEYDCKGFHIVGYFSKEKESTEDYNVACILTLPPYQRRGYGKLLI 
EFSYELSKVEGKTGTPEKPLSDLGLLSYRSYWSQTILEILMGLRSESGERPQITINEISEITsiKK 

EDVISTLQYl^INYYKGQYILTLSEDIVDGHERAMLKRLLRIDSKCLHFTPKDWSKRGKW 
>PTN 

TfS^^^^ PVQ ^ RQBSP ^^^ REG ^ TSESSV «^<5RAAAAQAGAAASRVPGLSGSN 
IAPCI^GRLSAREDVSNSKMQAQQYQQQRRKFAAAFLAPIFIIjAAVDTAEAGKKEKPEKKVKKSDC 
GEWQWSVCVPTSGDCGIjGTREGTRTGAECKQTMKTQRCKIPCNWKKQFGAECKYQFQAWGECDIjNT 

alktrtgslkralhnaecqktvtiskpcgkltkpkpqaeskkkkkegkkqekmld 

>PTPK 

S^INAAI^SYRQPAAFIVTQYPLPNTVKDFWRL^ 

y^cydvaleySss^ 0 ^^ 

>SETBD1 

^STSGLGIKDEGDIKQAKKEDTDDRNKMSV^^ 

^aosnpddvltlssstesegesgtsrkptagqtsatavdsddiqtissgsegddfedkknmtgp 

MKRQ VAVKSTRGFALKSTHGIAl KS TNMASVDKGESAPVRKNTRQF YDGEB S C Y 1 1 DAKLEGNLGR 

y^scspot.fvqnvfvdthdlrfpwvaffaskriragteltwdynyevgsvegkellcccgaiec 

RGRLL • 
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>SH3GL3 

EKI SGAEGTKLDDEFLDMERKI DVTNKWAE ILS KTTEYLQPNPAYRAKL 
GMLNTVSKI ROQVKTTGYPQTBGIiLGDCMLKYGKEIjGEDS TFGNAL I BVGESMKLMA^J^DS^n TM 
VKQTF IDPLQLLQDKDLKE I QHHLKKLEGRRLDYDYKKKRVGI^roE EATOQAVEKF^^^ 



> SUMO- 2 

n^TOSUKUCKAir CERQGLSMRQ IRFRFDGQPINETDTPAQLEMEDEDTIDVPQQQTGGVPESS 
> SUMO- 3 

>TAL1 




AENGK 
>TCPG 



LRGASKEIL. 

LRIDDIVSGHKKKGDDQSRQGGAPDAGQE IWEPIAVKLQTYKTAVBTAVLL 



>VIM 

SPRQRRSI^TTHTHRAIiVRLPSGSQSAPPPPP: 



>VIMc* 



Figure 6 (continued) 
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>ZHX1 

EQTINDLTFDGSPVKEENAEC2AESTEVSSSGI S ISKTPIMKMMKNKVENKRIAVHHNSVEDVPEEK 
I^IKPDI^EIVEOTSSSASESNTSTSIVNRIHPSTAS 

WTPEEVEEAI^QFNGTVHTVPQTI^ 

vtapiaxtvagvpsqnniqksqvpaaqptaetkpataavptsqsvkhetalvnpdsfgiraSctke 

QI^LKVSYLKNQFPHDSEIIIO^ITGLTKGEIKKWFSDTRTOQIWSK^S^s^^ 
liTRREIDAWPTEKTOCSKALKEEtCMEIDESNAGSSKEEAGETSPADESQAPKSGSTGKIC^^TPEOL 
SLRKRGRGRPKGRGRGRPRGRPRGSKRINNWDRGPSIjI KFKTGTAILKDYYLi^^^^jEODLnpr^v 

>ZNF33B 



Figure 6 (continued) 
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>ALEX2 



gagacttccagggcagcggtgcctcctgggacagtggt^Scc^SJ^gSc^Sg 



™™?^ GGGTGTAAC ^ 

GCTAATGGCGGACAGGCTTTCCTGGCAGAGGTCCCTGATTCTGftGGAA^ 
AACATGATCAAOVAAACTGATCC^CACATTAA^AAAAA 

rcGGOAAATCAGTGAGCACGAG 

:cagattaatgaggtgatgcgt 
:gacagacaggccctgaatgag 

DGAGCGACAGCGC CTGGTGGAA 
3GCTGCCTTGGAGGGCTTCCTG 



Figure 6 (continued) 
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>BAIP1 

cggccgcggacgaagatggcgaccgccatgtacttggagcactatc 

ccctgcgaacttcagaggaacttccagctgatgcgagagctggaccagaggacggaagataagaaa 

gcagagattgacatcctggctgcagagtagatctccacggtga 

gtggagcgcctgcagaagatccagaacgcctaca^ 

cagctggccatgcagacctacgagatggtggataaacacattcgaaggcttgatgcagacctggcg 

cgctttgaagcagatctgaaggacaagatggagggcagcgattttgaaagctccggagggcgagg^ 

ttaaaaaaaggc cgg ggt cagaaag aaaaaagagggtcccgggg c cgaggcaggaggacatgagag 

gaagacagaccaaagaaaaagaagcacaaaggagggtctgagttcactgaca 

cacccctctg&tgtgctggacatgcccgtggacccaaacxs^ 

gtctcctatggggagatgattggctgtgacaatcc^ 

tgcgtggaccttaccacgaaacccaaaggaaaatgg 

>BAIP2 

AGCCAGCAGGCCAGCGTGACCATGCACGATGTGGACGCCGAGTCCTTCGAGGTGTTGGTCGACTAC 

TGCTACACGGGTCGTGTGTCTCTCAGTGAGGCCAATGTGCAGCGCCTGTACGCGGCCTCCGACATG 

CTACAGCTGGAATATGTGCGGG^GCCTGTGCCTCCTTCTTAGCCCGACGTCTTGACCTGACCAAC 

TGCACCGCCATCCTCAAGTTTGCAGACGCCTTCGACCATCACAAGCTTCGATCTGAGGCC^ 

TACATAGCTCACAACTTCAAGCAGCTCAG 

CTAACCCTGGCCCAGCTGCTGGCXGTCCTACGCCTGGATAGTCTGGACATAGAGAGTGAGCGGACT 

GTATGCCATGTAGCTGTGC^GTGGCTGGAGGCTGCTGCCAAAGAGCGGGGTCCCAGTC^ 

GTCTTCAAGTGCGTGCGCTGGATGCACTTCACTGAAGAAGATGAGGACTACTTAGAAGGGCTGCTG 

ACCAAGCCCATCGTGAAGAAGTACTGCCTGGACGTTATTGAAGGGGCCCTGCAGATGC 

GACCTGTTGTACAAGTCTCTGGTGCCAGTGCCAAACAGCAGCAGC^ 

CTTGTATCTGCAGCJVGAAAATCGACCCCAGAGACTGGGTATGTGTGCCAAGGAGATGG 

TTTGGACATCCTAGAGATC CCTTTCTC TGCTATG^ 

TCCCCTTTGACCAGCTTTGCTC^CACTAAGACTC 

CATGAC^TCTATCTAGCTGCTCAGCCCAGGAAAGA 

TGGCAGCAACTTGCAGATCGCTTGCTGTGTCGTO 

ATCTACATTTTGGGGGGACGAGACCCTATTACTGGAGTTAAGTTGAAGGAAGTGGAATGCTACAGT 
GTTCAGAGAAACCAGTGGGCATTGGTGGCTCCTGTCCCTCATTCGTTCTATTCCTTTGAACTCATA 
GTGGTTCAGAACTATCTXTATGCTGTCAACAGTAAGCGCATGCTTTGCTATGATCCTAGCCACAAT 

atgtggctgaactgtgcttctcttaaacgtagtgactttcaggaagcatgtgtcttcaatgatgaa 

ATCTATTGTATCTGTGACATCCCAGTC^TGAAGGTCTACAACCaVGCTAGGGGAGAATGGAGGCGG 
ATTAGTAATATTCCTTTGGATTCAGAGACCCACAAC^^ 

cttctcatcacttctacaaccccacaatggaaaaagaaccgagtgacagtgtatgagtatgatact 

AGGGAAGATCAGTGGATTAATATAGGTACCATGTTAGGCCTTTTGCAGTTTGACTCTGGCTTTATT 

TGCCTTTGTGCTCGTGTTTATCCTTCCTGCCTTGAACCTGGTCAGAGTTTTATTACTGAGGAAGAT 

GATGCACGGAGTGAGTCTAGTACTGAATGGGACTTAGATGGATTCAGTGAGCTGGACTCTGAGTCA 

GGAAGTTCAAGTTCTTTTTCAGATGATGAAGTCTGGGTGGAAGTAGCACCTCAGCGAAAT 

GATCAGCAGGGTTCTTTG 
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>BAIP3 




^TATTTTCCCSGTGACl^S^J^ 




gatttgaaagaagataaacctacxsaaaagtttgt™ 

GTGACTCCTGAC^GGTCTGCAA^T^ 
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>CGI-125 

ttcgacgcgtccgcgcggaacttcgcccgcgtctctgggcttttoctctgtcaggctggtggcgtt 

ttggtgtcttcgtttgttatggccgctgctgtcgctatggag^cagatgatgctggaaatcgactt 
cggtttcagttggagttggaatttgtgcaatgtt^ 

caaagaggttacttcaaagacaaagcttttg^ 

CCAGAATATGC<^GTATCTAAAGTACCCT^ 

CACTTCCGAAAGGAGCTGGTGAATGCTCAGTGTGGGAAATTTATTGATGAACAGCAGATTCTACAT 

TGGGAGCACTATTCCCGGAAGCGGATGCGCCTTCAGCAAGCCTO 

AACACATCGGGAAAA 

>CGI-74 

GTAGAGAAAGGACGGGCAAAGAAAAGAGAAGCAGAGGAAGTTTAT CGGAATTCTATGC cagcttcc 

AGTTTTCAGCAGCAGAAACTTCGAGTCTGTGAAGTCl^CTCTGCCTATTTAGGACTTCATGATAAT 

GACAGACGACTGGCTGATCATTTTGGGGGTAAACTGCACCTGGGATTTATTGAAATAAGAGAGAAG 

CTTGAAGAATTAAAGAGAGTCGTAGCTGAGAAGCAGGAGAAAAGAAACCAGGAAGGGCTGAAACGA 

AGAGAAGAGAGAGAGAGAGAAGAAAGGGAGAAGCTGAGGAGGTCCCGATCACACAGCAAGAATCCA 
AAAAGG 

>CLH-17 

ATGGCCCAGATTCTGCCAATTCGTTTTCAGGAGCA^ 

gcaaacattggcttcagtaccctgac^^ 

ggagagcaggcccaggtggtaatcattgatatgaatgacccaagtaatccaattcgaagaccaatt 
tcagcagac^gcgccatc^tgaatccagctaggaaagtaattgcactgaaagctgggaaaactctt 
cagatttttaacattgaaatgaaaagtaaaatgaaggctcataccatgactgatgatgtcaccttt 

TGGAAATGGATCTCTTTGAATAC^ 

GAAGGAGAGTCTCAGCC^GTGAAAATGTTTGATCGCCATTCTAGCCTTGCAGGGTGCCAGATTATC 

AATTACCGTACAGATGCAAAACAAAAGTGGTTACTTCTGACTGGTATATCTGC^CAGCAA^ 
. GTGGTC^AGCTATGCACKZT^^ 

GCTAGCTTTGCACAGTTTAAGATGGAAGGAAATGCAGAAGAATCAACCT 

cggggccaagctggagggaagttacatattattgaagttggc^ 

TTTCCAAAGAAGGCAGTGGATGTCTTCTTTCCTCC^ 

c^gatcagtgaaaagcatgatgtggtc^ 

CTTG.AGACT 
>CLK1 

6ACQCG1^3GGTTCTGC3AACATCTGAATAC3^ACAQACCCCiAAC!AGTACTTTCCGCTGTGTCGAGATG 
TTGGAATCGTTTGAGOVTCATGGTC^CATTTGC^TTGTTTTTGAACTATTGGGACTTAGTACTTAC 
GACTTC^TTAAAGAAAATGGTTTTCTACCATTTCGACTGGATCATATCAGAAAGATGGCATATCAG 
ATATGCAAGTCTGTC^TTTTTTGCACAGTAATAAGTTGACTCACACAGACTTAAAGCCXGAAAAC 
ATCTTATTTGTGCAGTCTGACTACAGAGAGGCGTATAATCCCAAAATAAAACGTGATGAACGC1ACC 
TTAATAAATCCAGATATTAAAGTTGTAGR.CTTTGGTAGTGCAACATATGATGACGAACATCACAGT 
ACATTGGTATCTACARGACATTATAGAGCACCTGAAGTTATTTTAGCCCTAGGGTGGTCCCAACCA 
TGTGATGTCTGGAGCATAGGATGCATTCTTATTGAATACTATCTTGGGTTTACCGTATTTCCAACA 
C^CGATAGTAAGGAGCATTTAGCAATGATGGAAAGGATTCTTGGACCTCTACCAAAACATATGATA 
CAGAAAACCAGGAAACGTAAATATTTTCACC^CGATCGATTAGACTGGGATGAACACAGTTCTGCC 
^CAGATATQTTTCAAQACGCTQTAAACCTCTGAAGG^TTTATGCTTTCTCAAQATGTTGAACAT 

GAGCGTCTCTTTGACCTCATTCAQAAAATGTTGGAGTATGATCCAGCCAAAAGAATTACTCTCAGA 
GAAGCCTTAAAGCATCCTTTCTTTGACCTTCTGAAGAAAAGTATA 
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>DRP-1 



>EP1A 



GTATTGGATTGCCACACGGCTCACATTGCA^^™^I^ T ^ GC< ^ CGGCTAT GCCCCT 
AT G®^CTCXK^GCCCATGTGTGTTCAGa??S^^« TCTGG TGATGOXGCCATTGTTGAT 

^^cagcagcccaggtggtgcagJgg^ 
acctgggtgttccccacctc^ 

TOrCACCAG^CTKTCTOCrSGcSSi.^ 

;™rc<^racTTT<»TAA«»^S^ 



Figure 6 (continued) 



21/42 



>EP1G (prey) 

GCGGCTOGGACCCTGTACACGTATCCTGAAAACTGGAGGGCCTTCAAOGCTCTCATCGCTGCTCAG 

TACAGCGGGGCTCAGGTCCGCGTGCTCTCCGCACCACCCCACTTCCATTTTGGCCAAACCAACCGC 

ACCCCTGAATTTCTCCGCAAATTTCCTGCCGGCAAGGTCCCAGCATTTGAGGGTGATGATGGATTC 

TGTGTGTTTGAGAGCAACGCCATTGCCTACTATGTGAGCAATC^GGAGCTGCGGGGAAGTACTCCA 

GAGGCAGC^GCCCAGGTGGTGCAGTGGGTGAGCTTTGCTGATTCCGATATAGTGCCCCCAGCCAGT 

ACCTGGQTGTTCCCCACCTTGGGCATCATGCACCACAACAAACAGGCCACTGAGAATGCAAAGGAG 

GAAGTGAGGCGAATTCTGGGGCTGCTGGATGCTTACTTGAAGACGAGGACTTTTCTGGTGGGCGAA 

CGAGTGACATTGGCTGACATCACAGTTGTCTGCACCCTGTTGTGGCTCTATAAGCAGGTTCTAGAG 

^^^^ CC ^^^ CCTOTCC ^ TAC ^ CCGCTGGTTC CTCACCTGCATTAACCAGCCCCAG 

TTCCGGGCTGTCTTGGGCGAAGTGAAACTGTGTGAGAAGATGGCCCAGTTTGATGCTAAAAAGTTT 

GCAGAGACCCAACCTAAAAAGGACACACCACGGAAAGAGAAGGGTTCACGGGAAGAGAAGCAGAAG 

CCCCAGGCTGAGCGGAAGGAGGAGAAAAAGGCGGCTGCCCCTGCTCCTGAGGAGGAGATGGATGAA 

TGTGAGCAGGCGCTGGCTGCTGAGCCCAAGGCCAAGGACCCCTTCGCTCACCTGCCCAAGAGTACC 

TTTGTGTTGGATGAATTTAAGCGCAAGTACTCCAATGAGGACaCACTCTCTGTGGCACTGCCATAT 

TTCTGGGAGCACTTTC^TAACMACGGCTGGTCCCTGTGGTACTCAGAGTATCGCTTCCCTGAAGAA 
CTCACTCAGACCTraTGAGCTGC^ 

AAGAATGCCTTCGCCAGTGTCATCCTTTTTGGAACCAACAATAGCAGCTCCATTrCTGGAGTCTGQ 

GTCTTCCGAGGCCAGGAGCTTGCCTTTCCQCTGAGTCCAGATTGGCAGGTGGACTACGAGTCATAC 

ACATGGCGGAAACTGGATCCTGGCAGCGAGGAGACCCAGACGCTGGTTCGAGAGTACTTTTCCTGQ 

GAGGGGGCCTTCCAGCATGTGGGCAAAGCCTTCAATCAGGGCAAGATCTTCAAG 
>FEZ1 

GGCAACTGCTCTGAC^CTGAGATCCATGAGAAAGAAGAGGAAGAGTTCAATGAGAA^ 

GATTCCGGTATCAACGAGGAGCCTCTGCTCACAGCAGATCAGGTAATTGAGGAGATTGAGGAAATG 

ATGCAGAACTCCCCAQACCCTGAGGAAGAAGAGGAGGTTCTGGAAGAAGAGGATGGAGGAGAAACT 

TCCTCCCAGGCAGACTCGGTCCTCCTGCAGGAQATGCAGGCATTGACACAGACCTTCAACAACAAC 

TGGTCCTATGAAGGGCTGAGGCACATGTCTGGGTCTGAGCTGACCGAGCTGCTGGACCAGGTGGAG 

GGTGCCATCCGTGACTTCTCGGAGGAGCTGGTGCAGCAGCTGGCCCGCCGGGACGAGCTGGAGTTT 

GAGAAGGAAGTGAAGAACTCCTTTATCACGGTGCTTATTGAGGTTCAGAACAAGCAGAAGGAGCAG 

CGAGAACTGATGAAA^GAGGCGGAAAGAGAAAGGGCTGAGCCTGCAGAGCAGCCGGATAGAGAAG 

GGAAACCAGATGCCTCTGAAGCGCTTCAGCATGGAAGGCATCTCCAACATTCTGCAGAGTGGCATC 

CGCCA.GACCTTTGGCTCCTCAGGAACTGACAAACA.GTATCTGAACACAGTGATTCCTTACGAGAAG 

AAAGCCTCTCCTCCCTCAGTGGAAGACCTGCAGATGCTGACAAACATTCTCTTTGCCATGAAGGAG 

Q ATAATGAGAAGGTGCCTACTTTGCTAACGGACTACATaTTAA^GTGCTCTGCCCTACC 

ATGGCGTCGAGCGGCGGGGAGCTAGGGAGTTTATTTGATCACCACGTCCAGAGGGCGGTATGCGAC 
ACACGGGCCAAATATCGAGAGGGACGACGGCCTCGTGCTGTGAAGGTAXATACAATCAATTTGGAA 
TCTCAGTACTTATTAATACAAGGAGTTCCTGCTGTGGGAGTCATGAAGGAATTAGTTGAGCGATTC 
GCTTTATATGGTGCAATTGAAGAGTACAATGGTCTAGATGAATACCCAGCAGAAGACTTTACTGAA 
GTTTATCTTATTAAATTTATGAACTTACAAAGTGCAAGGACAGCCAAGAGAAAAATGGATOAACAG 
AGTTTCTTCGGTGGATTGCTTCATGTGTGCTATGCTCC^GAATTTGAAACAGTTGAAGAAACTAGA 
AAAAAACTACAAATGCGGAAGGCATATGTAGTAAAAACTACTGAAAATAAAGACCATTACGTGACA 

aagaagaaattggttacagagcataaagacacagaggattttagacaagacttccactcagagatg 

TCTGGATTTTGTAAAGCTGCTTTGAACACTTCTGCAGGGAACTCAAATCCTTATCTTCCGTATTCC 
^^ TTGOTTTATGCT ^^^ 

GCACCAGACTCCTCTAAGGATGGTAGAAACCATCATAAAACAATGGGGCATTATAACCACAA'rGAC 

tctttgcggaaaacacagataaactctttgaaaaactcagtggcctgccctggtgcacaaaaggct 

A^ACGTCTTC^GAGGCAGTTGACAGATTTATGCCTAGGACAAC^C^CTGCAGGAGCGC^AAGA 
AGAAGAGAAGATGATCGTAAACTTGGAACTTTTCTTCAAACAAACCCAACTGGTAATGAGAT^Ta 
ATTGGACCTCTGTTACCAGACATCTCTAAAGTGGATATGCACGATGACTCATTGAATACAACGGCG 

AApTAATTCGGCATAAACTTAAAGAGGTAATTTCATCTGTGCC^GCCTCCAGAGGACAAGCCA 
GAAGATGTACATACAAGTCATCCATTAAAACAAAGAAGAAGAATA 
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>Q45IP2 



GGCGCCCTGAAGGAGCTCAGcWggCTCA^ 

tt^gtt OT gacctccaa^ 

>G45IP3 

aacacgagaaagc^cataccaS 

AATCCTATTTATTACCTCAGJ^GTTTTT^^CTT 

cctagcccctaccccc<^tta^ 
atgcagggtgcccsgc^g^^ 

GCCGGqGTCTACGAGTCAGCCAAAGT^ 

GCTGQSGGTGAGGAGGACGA^GGCGACATOGCG^TGCAGATCP^^w 

tgctgcgagaacgac^tcc3acatagtg^ 

GCCGGCGAWAGGCGGGTGCGCC^ 
GCCTGGAAGGATCCCGCCTTG^^ 

TGGGTGCCCAGCATCACCCTCCCCGAG UAUl - C;TGTTTTGCG AGGAGAGCCGCAGCGrTAACGAC 
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>GIT1 

CCACAGATGGCTG&CAC^TCTCGG^ 

AAAGCTGCTAAGAAGAAGCTGCAGGCGCTCAGCAACCGGCT^ 
TATGACGAGGTGGATCGAAGAGAAAATGATGCAGTGTGGCTGGCTACCC^^ 

GTGACAGAGCGCAGTGCCGTGCCCTTCCTGCCTGTTAACCCGGAATACTCAGCCACGCGGAATCAG 

GGGCGACAAAAGCTGGCCCGCTTTAATGCCCGAGAGTTTGCCACCTTGATCATCGA 

GAGGCCAAGCGGAGACAGCAGGGCAAGAGCCTGAGCAGCCCCACAGACAACCTCGAGCTGTCTCT 

CGGAGCCAGAGTGACCTCGACGACCAACACGACTACGACAGCGTGGCCTCTG^CGAGGACACAGAC 

CAGGAGCCCCTGCGCAGCACCGGCGCCACTCGGAGCAACCGGGCCCGGAGCATGGACTCCTCGGAC 

TTGTCTGACGGGGCTGTGACGCTGCAGGAGTAC CTGGAGCTGAAGAAGGCCC TGGCTACATCGGAG 

GCAAAGGTGCAGCAGCTCATGAAGGTCAACAGTAGCC^ 

GAGATCCACAAGCTGCAGGCGGAGAACCTGGAGCTCCGGCAGCCTCCAGrGGCCGGTGCCCACACCT 

CCACTCCCCAGTGAACGGGCGGAACACACACCCATGGCGCCAGGCGGGAGCft.GACACCGC^ 

CGCCAGGCCTTTTCCATGTATGAACCTGGCTCTGCCCTGAAGCCCTTTGGGGGCCCCCCTGGGGAC 

GAGCTCACTACGCGGCTGCAGCCTTTCC^C^GCACTGAGCTAQAGGACGACGCCATCTATTCAGTG 

CACGTCCCTGCTGGCCTTTACCGGATCCGGAAAGGGGTGTCTGCCTCAGCTGTGCCCTTCACTCCC 

TCCTCCCCGCTGCTGTCCTGCTCCCAGGAGGGAAGCCGCCACACGAGCAAGCTTTCCCGCCACGGC 

AGTGGAGCCGACAGTGACTATGAGAACACGCAAAGTGGGGACCCACTGCTGGGGCTGGAAGGGAAG 

AGGTTTCTAGAGCTGGGCAAAGAGGAAGACTTCCACCCAGAGCTGGAAAGCCTGGATGGAGACCTA 

GATCCTGGGCTTCCCAGCACAGAGGATGTCATCTTGAAGACAGAGCAGGTCACCAAGAAC^ 

GAACTGTTGCGGGCAGCCCAGGAGTTCAAGCATGACAGCTTCGTGCCCTGCTCAGAGAAGATCCAT 

TTGGCTGTGACCGAGATGGCCTCCCTCTTCCCAAAGAGGCCAGCCCT.GGAGCCAGTGCGGAGCTCA 

CTGCGGCTGCTCAACGCCAGCGCCTACCGGCTGCAGAGTGAGTGCCGGAAGACAGTGCCCCCAGAG 

CCCGGCGCCCCAGTGGACTTCCAGCTGCTGACTCAGCAGGTGATCCAGTGCGCCTATGACATCGCC 

AAGGCTGCCAAGCAGCTGGTCACCATCACCACCCGAGAGAAGAAGGAG 

>hADA3 

AAAGATGTGGATGC CCTGCTGAAGAAGTCTGAGGCCCAGCATGAACAGC CGGAAGATGGATGCCCC 
TTTGGTGCCCTGACGCAGC^CCTCCTGCAGGCCCTGGTGGAGGAAAATATTATTTCCCCTATGGAG 
GATTCTCCTATTCCTGACATGTCTGGQAAAGAATCAGGGGCTGACGGGGCAAGCACCTCCCCTCGC 
AATCAGAACAAGCCCTTCAGTGTGCCGCATACTAAGTCCCTGGAGAGCC 

ATTGCCCAGGGCCTTTTGGAGTCTGAGGACCGCCCCGCAGAGGACTCCGAGGATGAGGTCCTTGCT 
GAGCTTCGGAAACGGCAGGCTGAGCTGAAGGCACTTAGTGCCCACAACCGCACCAAGAAGCACGAC 
CTGCTGAGGCTGGCAAAGGAGGAGGTGAGCCGGCAGGAGCTGAGGCAGCGGGTGCGCATGGCTGAC 
AACGAGGTCATGGACGCCTTTCGCAAGATCATGGCTC 

AAAGAAAAGGACCAGGCCTGGAAGACTCTGAAGGAGCGTGAGAGCATCCTGAAGCTGCTGGATGGG 
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2>HB01 
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>HD1.7 



^^gaatggtgcccctc^^ 

sccgtgcctgactcgaacaagcaag 

rccCAAAATTATGGCTTCTTTTGGC 



„ m „ t'ATGAACTGACQTTACATCATA 
GTGACCaOAGCCCTGGAQCTGTTGCAGCAGCTCTTCAGAACG© 



ICACAATGTT 
•CTGCAAACC 



>HDdl .0 



?CATTGTCTGAO 
^CTTCTGGGCATC 



CGCTATGGAA 
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>HDdX . 3 



>HDexQ51 
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>HIP11 

QTGGACCTTGTC^CCGCCTGTGACATCCGGTACTGTGCCCAGGATGCTTTCTTCCAGGTQAAGQAG 
GTGGA.CGTGGGTTTGGCTGCCGATGTAGGAACACTGCAGCGCCTGCCCAAGGTCATCGGGAACCAG 
AGCCTGGTCAACGAGCTGGCCTTCACCGC C CGGAAGATGATGGCTGACGAGGCCCTGGGCAGTGGG 
CTGGTCAGCCGGGTGTTCCCAGACAAAGAGGTCATGCTGGATGCTGCCTTAGCGCTGGCGGCCGAG 
ATTTCCAGCAAGAGCCCCGTGGCGGTGCAGAGCACCAAGGT 

TCGGTGGCCGAGAGCCTCAACTACGTGGCGTCCTGGAACATGAGCATGCTGCAGACCCAAGACCTC 

GTGAAGTCGGTCCAGGCCACGACTGAGAACAAGGAACTGAAAACCGTCACCTTCTCCAAGCTC 

>HIP13 

CCCTGCTGCTCTGAGGACACCATCCCTTCCCAAGTTTCAGATTATGATTATTTCTCTGTAAGTGGT 

GAC CAGGAGGCAGAT CAG CAGGAGTTCGACAAGTCCTC CACC ATTC CAAGAAACAGCGACATCAGC 

CAGTCCTACCGACGGATGTTCCAAGCCAAGCGTCCAGCCTGAACTGCTGGCCTCCCC^ 

GGACCTGCTATGGTCACTCCAGGGGTTGCAACTATCCQACGGACCCCTTCCACCAAGCCTTCTGTC 

CGCCGGGGAACCATTGGAGCTGGTCCCATCCCCATCAAGACACCCGTGATCCCTGTCAAGACCCCA 

ACCGTCCCAGACCTCCCAGGGGTGTTGCCAGCCCCTCCAGATGGGCCAGAAGAGCGGGGGGAGCAC 

AGCCCTGAGTCGCCATCTGTGGGTGAGGGCCCCCAAGGTGTCACCAGCATGCCCTCCTCAATGTGG 

AGCGGCCAAGCTTCCGTTAACCCTCCACTTCCAGGCCCGAAGCCCAGTATCCCTGAG 

CAGGCAATTCCAGAAAGTGAAGCTGAAGACCAGGAAC 

GGCCAGATTCCAGAGAGTGACCCTGCAGACCTGAGCCCAAGGGATACTCCAGAAGGAGAAGACATG 

CTGAACGCCATCCGAAGGGGCGTGAAACTGAAGAAGACCACGACAAACGATCGCTCAGCCCCTCGC 

TTTTCT 

>HIP15 

ATTCACATGGCTCCACCTTATCC^AATCTAAACATGATTGA 

GAAACCCTTGC^CATAGTGTGGATTCCCTTCAGCAGCTGACTGGAATAAGGATGCTTAGACACCTC 
ACTATGACXATTGACTATCACACACTQATTGCCAACTATATGTCCGGGTTTCTCTCCTTATTAACC 
. ACAGCGAATGCGAGAACGAAGTTTCACGTTCTGAAAATGCTATTGAAT^TTGTCTGAAAATCCTGCT 
GTGGCAAAAAAACTATTCAGTGCCAAAGCTCTTTGAATATTTGTGGGTCTCTT 
ACAAATGATAATATTCAAATTGTTATT^ 

AAGATGTCCTTAATTGATGATGATTTCAGTCTTGAGCCGCTTATTTCTGGATTTCGTGAATTTGAG 
GAGTTAGCTAAGCAACT ACAAG CCCAAATAG ACAAC CAAAATGATCCTGAGGTGGGACAACAAAGT 
>HIP16 

GATGAAGAGGAGAGAAACGATAGGCAGATGATAAAGGAAGCTTTTGCTGGGGATGATGTCATGAGA 
GATTTCTTGAAAGAGAAGAGGGAAGCTGTGGAGGCGAGTAAGCCAAAGGACGTGGACCTGACACTA 
CCTGGCTGGKMCGAGTGGGGTGGTGTGGGCCTAAAGCCCAGTGC 

ATTAAAGCCCCTGAGGGTCCTCCAAGAAAAGATAAGAATTTGCCAAATGTGATTATC^ 
CGCAACATCCACGCAGCTGCTCATCAGGTACGAGTGCTTCCAT 
TTTGAAAGGACCATCCAGACCCCCATAGGATCCACAT 
ACTACTCCCAAGGTCGTCACCAAGCCAGGC<^^ 

TACCGGTCTTCCTCAAGGTCGGACCTGTCTGTCATACAGAGGAATCCAAAACGAATCACCACACGT 

GAGAAAAAACAGCTGAAGAAATGCTCTGTAGAT 

>HIP2 

ATGGCCAACATCGCGGTGCAGCGAATCAAGCGGGAGOT 

AGCAAAAATCAAATTAAAGTAGATCTTGTAGATGAGAATTTTACAGAATTAAGAGGAGAAATAGCA 
GGACCTCCAGACACACCATATGAAGGAGGAAGATACCAACTAGAGATAAAAATACCAGAAACATAC 
CC^TTTAATCCCCCTAAGGTCCGGTTTATC^CTAAAATATGG<^TCCTAATATTAGTTCCGTC^CA 
GGGGCTATTTGTTTGGATATCCTGAJ\AGATCAATGGGCAGCTGCAATGACTCTCCGCACGGTATTA 
TTGTCATTGOkAGCACTATTGGCAGCTGC^ 

CAGTACAAAC AAAAT C CC GAAATGTTCAAACAGACAGCT CGACTTT GGGCACATGTGTATGCTGGA 
GCACCAGTTTCTAGTCCAGAATACACCAAAT^AAATAGAAAACCTATGTGCTATGGGCTTTGATAGG 
AATGCAGTAATAGTGGCCTTGTCTTCAAAATCATGGGATGTAGAGACTGCAACAGAATTGCTTCTG 
AGTAACTG 
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>HIP5 (bait) 




QAAAAACKSTGOVGAAATTCCAAAGACW 
GAAAACAATGCTGAAAACAGT^SCTGAta^S^ 

CAATTC ^^TTCAAAGTGGTGCTGG^f^^^n^r^^^^^ TCAAC AG CATTCXCAA 
A^GACAAAACCCTGGATCTCTAgIS^ 

^TAGTACTTCTGAGOTTTTGATG^r^n^V^^ QQTACTT CTTATATTGAAGAAGTTTCA 
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>HMP 

CAAGAACAAGTTAAAATTGAGTCTCTAGCCAAGAGCTTAGAAGA 

GTCACTCTGCAGGCTATTGCAGCTCAGAATGCTGCGGTC 

TTGAAAGCCGCC^TGGACAATTCTGAGATTGCAGGCGA 

GAGGGTG CATTGAAGGAACG C AGAAAGGCAGT AGATGAAGCTGCC GAT GCCCT TCTCAAAGCCAAA 
GAAGAGTTAGAGAAGATGAAAAGTGTGATTGAAAATGCAAAG 

CCTCATATAACTGCTGCAGAGGGTAAAC TTCACAACATGATAGTTGAT CTGGATAATGTGGTCAAA 

AAGGTCGAAGCAGCTCAGTCTGAGGCTAAGGTTGTATCTGAGTATCATGAGCTGGTGGTCCAAGCT 

CGGGATGACTTTAAACGAGAGCTGGACAGTATTACTCCAGAAGTCCTTCCTGGGTGGAAA 

AGTGTTTCAGACTTAGCTGACAAGCTCTCTACTGATC^ 

CGTCGTATTGATCAGCTGAACAGAGAGCTGGCAGAACAGAAGGCCACC 

TTAGCCTTGGAGAAACAAAAGCTGGAAGAAAAGCGGGCATTTGACTCTGCAGTAGCAAAAGCA 

GAACATCACAGAAGTGAAATACAGGCTGAACAGGACAGAAAGATAGAAGAAGT 

GAAAATGAAATGAGAACCC^GCTTCGCCGACAGGCAGCTGC^ 

CTTAGGGTACAAQAACAGGAATTG7VAGTCTGAATTTGAGCAGAACCTGTCTGAGAAACTCTCTGAA 

GAAGAATTACAATTTCGTCGTCTCAGTCAAGAGCAAGTT 

GCCTATGCCAGACTCAGAGGAATCGAACAGGCTGTTC 

AGAAAAGG C CAC CAACTCTGG CTTTC AGTGGAGGCATTAAAGTACAGC ATGAAGACCTCAT CTGCA 
GAAAC&CCTACTATCCCGCTGGGTAGTGCGGTTGAG 

TTCACCCAAGCTTTAACCGCAGCTATCCCTCCAGAGTCCCTGACCCGTGGGGTGTACAGTGAAGAG 
ACCCTTAGAGCCCGTTTCTATGCTGTTCAAAAACTGGCCCGAAGGGXAGCAATGATTGATGAAACC 
AGAAATAGCTTGTACCAGTACTTCCTCTCCTACCTACAGTCCCTGCTCCTATTCCCACCTCAGCAA 
CTGAAGCCGCCCCCAGAGCTCTGC cctgaggatataaagacatttaaattactgtcatatgctt cc 

TATTGCATTGAGCATGGTGATCTGGAGCTAGCAGGAAAG 
AGACGAGTGGCACAGGACTGGCTGAAGGAAGCCCGAATGACCCT^^ 
ATCCTGACAGCATATGCCAGCGCCGTAGG^^TAGGAACPACTCAGGTGCAGCCAGAG 
>HP2 8 

CCGCCCGCAGACTCTTTGCTCAAGTACGACAC^ 

AGCC C CAAGGCTCGGCTACXGAAAGTCAGCCCCCAGCAG CCTGGACCTTCAGGTTCAGCCCCACAG 
CCACCCAAGACCAAGCTCCCCTGAACTCCCTGTGTCCCAGATCCTACAAAGCAGGCAGAAGAAATC 
TTGAATGCCATACTACCCCC^U^GGGAGTGGGTGGAAGACACGCAGCTATGGATCCAGCAGGTGT 
AGC^CCCCTAGCACCAGGATGGACGTGGTGCACCTC 

CGGCAGGC CAGGGAAACAGGCATCTGCCCTGTC CGCAGGGAACTCTACTCACAGTGTTTTGATGAG 
TTGATCCGGGAGGTCACCATCAACTGTGCGGAGAGGGGGCTGCTGCTGCTGCGAGTCCGGGACGAG 
ATCCGCATGACCATCGCTGCCTACCAGACCCTGTACGAGAGCAGCGTGGCGTTTGGCATGAGGAAG 
GCACTGCAGGCTGAGCAGGGGAAGTCAGACATGGAGAGGAAAATCGC^ 

AGAGACCTGGAGAGGC AAG TGAACGAGCaGAAGGCAAAATGTGAAG C CACTGAGAAG CGGGAGAGC 
GAGAGGCGGCZAGGTGGAGGAGAAGAAGCACAATGAGGAGATTCAGTTCCTGAAGCGAACAAATCA^ 
CAGCTGAAGGCCCAACTGGAAGGCATTATTGCACCAAAGAAG 
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>HSPC232 

CGGCGGCGAGCGGACGGCTGCATTTACQGGQTCTCCCGGAGGGCCAGAGTCGTGQCTTACAGAAGA 
GACGAAATGTGGTCTGAGGGACGATATGAATATGAAAGAATTCCGAGAGAACGAGCACCTCCTCQA 
AGTCATCCCAGTGATGAATCTGGTTATAGATGGACAAGAGACGATCATTCTGCAAGCAGGCAACCT 
GAATACAGGGACATGAGAGATGGCTTTAGAAGAAAAAGTTTCTACTCTrCCCATTATGCGAGAGAG 
CGGTCTCCTTATAAAAGGGACAATACTTTTTTCAGAGAATCACCTGTTGGCCGAAAGGATTCTCCA 
CACAGCAGATCTGGTTCCAGTGTCAGTAGCAGAAGCTACTCTCCAGAAAGGAGCAAATCATACTCT 
TTCCATCAGTCTCAACATAQAAATAAAGAGAGGCCTGTCCAGTCTTTGAAAACATCAAGAGATACT 
TCACCCTCAAGTGGTTCAGCAGTTTCTTCATCAAAGGTGTTAGACAAACCCAGTAGGCTAACTGAA 
AAGGAACTTGCTGAGGCTGCAAGCAAGTGGGCTGCTGAAAAGCTAGAGAAATCAGATGAAAGTAAC 
TTGCCTGAAATTTCTGAGTATGAGGCGGGATCCACAGCACCATTGTTTACTGACCAGCCAGAGGAA 
CCTC^GTCAAACACAACACATGGGATAGAATTATTTGAAGATAGTCAGCTAACCACTCGCTCTAAA 
G ^ AGCATCAAAAACCAAAG ^^ 

GTGGTGAAAATGCTGATTGAAAAAGATCCTTCATTAGAAAAGTCTATACAGTTTGCATTGAG^ 
>HYPA 

GGCCGCCGGCGGAGCAGTCTGAGCCCGACGATGAGGCCGGGGACGGGAGCTGAGCGTGGAGGCCTC 

^TGCCTGGAATGATGTCGTCAGTAATGCCTCSGAATGATGATGTCTCATATGTCTCAGGC^ 
AAATCAATGTGGACTGAACATAAATCACCTGATGGAAGGACTTACTACTACAACACTGAAACCAAA 

cagtctacctgggagaaaccagatgatcttaa^ 

TGGAAGGAATACftAATGAGATTCXGGAAAGCCTTACTATTATAATTCTCAAAC 

A ^ A ?^ TCAAACCTGCAT ^ 

ACAACATC^C^GCTCCAGTCCCTACAACAGAAATTCCGACCACAATGAGC^CCA?^ 

«mcmcmctoct^^ 

A f?^ TTOTTGCTACTGTTO ^^ 

^S^ AC ^ STACCCCTQCTATTCAGGATCAAAGTCT 

A S^™ CAAGA J^^^ 

AAGAA j^ CATACACTOG ^™^ 

A 5^^ A ^^ TC ^ TGC ^ TGGGAGCAGGCT ^ 

AAAGAAAAAAAAAAA/^ 
>HZFH 

GACATGAAGGCGGACGTGACCCGCCTGC<^GCCACGCTGTCCCGAATACC^ 

?™in GTCCGAOCG ^ G ^ TCCTCAGCCGGCTGG ^ 
£Sf^ TACCCGCCGGGTCCC ^^ 

GTAGGGGCCCTGGCCGCCGCAGGCGCCAATTACAGCCAGATGCCTGCAGGGTCCTT 
GGGCTGGATCGGAAGGAGCCCCGAGCCGGGGAGGTGATCTGTATAGACGAC 
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>IKAP 

CTCAAAGAAGGCAGTCCGCTGGAGGACCTGGCCCTCCTGGAGGCA^ 
ACTGAAAACCTGAAAGATGAAGTATACCATATTTTAAAGGTACT 
CAAGGAAGGGAATTAC^GAAGGCCTTTGAAGATACGCT^^ 
ATTTGGACTCTTACTTACCAGCAGAATTCAGCTACCCC^ 

AGTATCATGGCATCTTATCAGCAACAGAAGACTTCGGTTCCTGTTCTTGATGCTGAGCTTTTTATA 

CCACCAAAGATCAACAGAAGAACCCAGTGG^GCTGAGCCTGCTAGAC 

>IMPD2 

GACTTTCTCATTCTCCCTGGGTACATCGACTTCACTGCAGACCAGGTGGACCTGACTTCTGCTCTG 
ACCAAGAAAATCACTCTTAAGACCCCACTGGTTTCCTCTCCCATGGACACAGTCACAGAGGCTGGG 
ATGGCCATAGCAATGGCGCTTACAGGCGGTATTGGCTTC^TCCACCACAACTGTACAC 
CAGGCCAATGAAGTXCGGAAAGTGAAGAAATATGAACAGGGATTCATC^CA 

AGCCCCAAGGATCGCGTGCGGGATGTTTTTGAGGCG^GGC CCGGCATGGTTTCTGCGGTATC CCA 
ATGACAGACACAGGCCGGATGGGGAGCCGCTTGGTGGGCATCATC^ 

CTCAAAGAGGAGGAACATGACTGTTTCTTGGAAGAGATAATGACAAAGAGGGAAGACTTGGTGGTA 
GCCCCTGCAGGCATCACACTGAAGGAGGCAAATGAAATT 

CCCATTGTAAATGAAGATGATGAGCTTGTGGCCATCATTGCCCGGACAGACCTGAAGAAGAATCGG 
GACTACCCACTAGCCTCCAAAGATGCCAAGAAACW 

GAGGATGACAAGTATAGGCTGGACTTGCTCGCCCAGGCTGGTGTGGATGTAGTGGTTTTGGACTCT 
TCCCAGGGAAATTCCATCTTCCAGATCAkTAT 

CAAGTCATTGGAGGCAATGTGGTCACTGCTGCCCAGGCCAAGAACCTCATTGAT 

GCCCTGCGGGTGGGCATGGGAAGTGGCTCCATCTGCATTACGCAGGAAGTGCTGGCCTGTGGGCGG 

CCCCAAGCAACAGCAGTGTACAAGGTGTCAGAGTATGCACGGCGCTTTGGTGTTCCGGTCATTGCT 

GATGGAGGAATCCAAAATGTGGGTCATATTGCGAAAGCCTTGGCCCTTGGGGCCTCCACAGTCATG 

ATGGGCTCTCTCCTGGCTGCCACCACTGAGGCCCCTG'GTGAATACTTCTTTTCCGATGGGATCCGG 

GTAAAGAAATATCGCGGTATGGGTTCTCTCGATGCCATGGACAAGCAGC1?CAGCAGCCAGAA 

TATTTCAGTGAAGCTGACAAAATCAAAGTGGCCC^ 

TCAATCCACAAATTTGTCCCTTACCTGATTGCTGGCATCCAAGACTCATGCCAGG^ 

AAGAGCTTGACCCAAGTCCGAGCCATGATGTACTCTGGGGAGCTTAAGTTTGAGAAGAGAACGTCC 

TCAGCCCAGGTGGAAGGTGGCGTCCATAGCCTCCATTCGTATGAGAAGCGGCTTTTC 

>KPNA2 

GCTTGGGCACTCACTAACATTGCTTCTGGGAC^T 

GCCATCC CAGCATTCATTTCTCTGTTGGCATCTCCCCATGCT CACATCAGTGAACAAGCTGTCTGG 
GCTCTAGGAAACATTGCAGGTGATGGCTCAGTGTTCCGAGACTTGGTTATTAAGTACGGTGCAGTT 
GACCCACTG1TGGCTCTCCTTGCAGTTCCTGATATGTCATCTTTAGCATGTGGCTACTTACGTAAT 
CTTACCTGGACACTTTCTAATCTTTGCCGCAACAAGAATCCTGCAC 

CAGATTCTTCCTACCTTAGTTCGGCTCCTGCATC^TGATGATCCAGAAGTGTTAGCAGATACCTGC 
TGGGCTATTTCCTACCTTACTGATGGTCCAAATGA^ 

GTGCCCCAACTTGTGAAGCTTCTAGGAGCTTCTGAATTGCCAATTGTGACTCCTGCCCTAAGAGCC 
ATAGGGAATATTGTCACTGGTACAGATGAACAGACTGAGGTT^ 

GTCTTTCCCAGCCTGCTCACCAACCCC^VAAACTAACATTGAGAAGGAAGCTACGTGGACAATG 

AACATCACAGCCGGCCGCCAGGACCAGATACAGCAAGTTGTGAATCATGGATTAGTCCCAT 

GTCAGTGTTCTCTCTAAGGCAGATTTTAAG 

ACCAGTGGTGGAACAGTTGAACAGATTGTGTACCTTGTTGACTGTGGCATAATAGAACCGTTGATG 
AACCTCTTAACTGC3VAAAGATACCAAGATTACT 

CAGGCTGCTGAGAAACTAGGTGAAACTGAGAAACTTAGTATAATGATTGAAGAATG'TGGAGGCTTA 
GACAAAATTGAAGCTCTACAAAACCATGAAAATGAGTCTGTGTATAAGGCTTCGTTAAGCTTAAOT 
GAGAAGTATTTCTCTGTAGAGGAAGAGGAAGATCAAAACGTTGTACCAGAAACTACCTCTGAAGGC 
TACACTTTCCAAGTTCAGGATGGGGCTCCTGGGACCTTTAACTTT 
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>KPNB1 

^ATCTGAATGAGCTAA^SGA^ 

ATAGGGGACTTATGTAC3iGCATTTGGGAAGGATGTACTGAAATTAGTAGAAGCTA^G 
>Ku70 

™™°^ TCTGGAGAAG ^^ 



A ^ GA ^ Gc r^ rGAGAAccccGTGcTG 



>LUC7B1 • 



CTTGAGAGCTCCAACGGGAAGATGGCT 
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>MAPllc3 

CAGCGGCGC^GCTTCGCCGACCGCTGTAAGGAGGTACACX^GATCCGCGACCAGCACCCCAGCAAA 

ATCCCGGTGATCATCGAGCGCTACAAGGGTGAGAAGCAGCTGCCCGTC CTGGACAAGAC CAAGTTT 

TTGGTCCCGGACCATGTC^ACATGAGCGAGTTGGTCAAGATCATCCGGCGCCGCCTGCAGCTGAAC 

CCCACGCAGGCCTTCTTCCTGCTGGTGAACCAGCACAGCATGGTGAGTGTGTCCACGCCCATCGCG 

GACATCTACGAGCAGGAGAAAGACGAGGACGGCTTCCTCTATATGGTCTACGCCTCCCAGGAAACC 

TTCGGCTTC 

>TtlHAPl 

CCGAAAGAGCAGGTGCAGAGCGGTGCGGGAGACGGGACAGGGTCGGGGGACCCAGCAGCAGGCACC 

CCCACGACCCAGCCTGCAGTTGGTCCCGCTCCGGAGCCCTCGGCGGAGCCCAAACCTGCTCC^ 

CAGGGAACCGGGTCCGGAC^AAAATCAGGATCCCGAACCAAGACAGGAAGCTTTTGTCGGTCCATG 

ATCATTGGTGATTCGGACGCACCATGGACCCGCTACGTATTCGAGGGGCCTTACGGTCCCCGGGCC 

ACTGGCCTGGGCACTGGAAAGGCCGAGGGAATCTGGAAGACACCAGCCGCGTACATCGGCCGGAGG 

CCCGGCGTGTCCGGCCCTGAGCGTGCGGCGTTTATTCGAGAGCTGCAGGAAGCGTTGTGTCCTAAT 

CCACCACCCACGAAGAAGATCACCGAAGATGATGTCAAAGTGATGTTGTATTTGCTGGAAGAGAAA 

GAACGGGACCTGAACAGAGCCGCCCGGATCGGCGAGTCCCT^ 

GAGGAGAATAATAAGCTGGAAACCATGCTGGGCTCAGCCAGGGAGGAGA^ 

CAGGTGAACCTGCGAGATGACCTTCTTCZAGCTCTACTCAGACTCTGATGAGGATGATGATGAGGAA 
GACXSAGGAAGACGAGGAAGAGGGCGAAGAGGAGGAACGAGAAGGACT^ 

CACGACCAC CCCTATGGTGCCCCCAAGCCACACCCTAAGGCTGAGACAGCGCACCGCTGC CCACAG 

CTGGAAACCCTGCAGCAGAAGCTCAGGCTTCTGG^ 

TCCCACCTTGACAACCTGGAGGACGAAGAGC^GATGCTC^ 

GAAGCCAGCCAGCAGATGGCAGAGCTATCGGAAGTGCTGGTGTTGAGGCTGGAAGGCTATGAGAGG 

CAGGAGAA^GAGATCACTCAGCTGCAGGCCGAGATC^CCAAGCTACAACAGCG 

GGGGCCCAGACGGAGAAACTGGAGCAGATGCTGGCCTCAGAGAAGGGGATCCACTCGGAGAGCCTG 

CGAGCTGGCTCCTACATGCAGGATTATGGGAGCAGGCCTCGTGACCGCCAGGAGGATGGGAAGAGT 

CATCGCCAGCGCTCCTCCATGCCCGGAGGCTCTGTCACCCACTATGGATACAGTGTGCCTCTGGAT 

GCACTTCCAAGTTTCCCAGAGAC^CTGGCTGAGGAGCTCCGAACATCTCTGAGGAAGTTCATCACT 
GACCCTGCGTATTTCATGGAGAGACGTGACACTC^^ 

GCGATGCCACCCCCACCGGCTCA 

>mp53 

GTCACCGAGACCCCTGGGCCAGTGGCCCCTGCCCCAGCCACTCCATGGCCCCTGTCATCTTTTGTC 
CCTTCTCAA?UVAACTTACCAGGGC^ 

AAGTCTGTTATGTGC^CGTACTCTCCTCCCCTCAATAAGCTATTCTGCC^GCTGGCGAAGACGTGC 

CCTGTGCAGTTGTGGGTCAGCGCCACACCTCCAGCTGGGAGCCGTGTCCGCGCCATGGCCATCTAG 

AAGAAGTCACAGCACATGACGGAGGTCX^TGAGACGCTGCCCCCACCATGAGCGCT 

GATGGCCTGGCTCCTCCCCAGCATCTTATCCGGGTGGAAGGAAATTTGTATCCCGAGTATCTGGAA 

GACAGGCAGACTTTTCGCCACAGCGTGGTGGTACCTTATGAGCCACCCGAGGCCGGCTCTGAGTAT 

ACGACCATCC^CTACAAGTACATGTGTAATAGCTCCTGCATGGGGGGCATGAACCGCCGACCTATC 
CTTACCLATCATCACACTGGAAGACTCC^^ 

GTTTGTGCCTGCCCTGGGAGAGACCGCCGTACAGAAGAAGAAAATTTCCGCAAAAAGGAAGTCCTT 
TGCCCTGAACTGCCCCCAGGGAGCGCAAAGAGAGCGCTGCCCACCTGCACAAGCGCCTCTCCCCCG 
CAAAAGAAAAAACCACTTGATGGAGAGTATTTCACCCTCAAGATCCGCX3GGCGTAAACGCTTCGAG 
ATGTTCCGGGAGCTQAATGAGGCCTTAGAGTTAAAGGATGCCCATGCTACAGAGGAGTCTGGAGAC 
AGC^GGGCTCACTCCAGCTACCTGAAGACCAAGAAGGGCCAGTCTAGTTCCCGCCATAAAAAAACA 
ATGGTCAAGAAAGTGGGGCCTGACTCAGAC 
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>NAG4 

CGAGACCGGGTGGAGAATGAGGCAGAAAAAQATCTCCAGTGTCACGCCCCTGTGAGATTAOACTTa 
CCTCCTGAGAAGCCTC^C^CAAGCTCTTTAGCCAAACAAGAAGAAGTAGAACAGACACCCCTTCAA 

gaagctttgaatcaactgatgagaouvttgcagagaaaagatccaagtgctttcttttcatttcct 

GTGACTGATTTTATTGCTCCTGGCTACTCCATGATCATTAAACACCCAATGGATTTTAGTACGATG 

AAAGAAAAGATCAAGAAC^TGACTATCAGTCCATAGAAGAACTAAAGGATAACTTCAAACTAATG 

^^?^^^ T ^ TTTA ^ T ^ CCA QAGACCATTTATTATAAAGCTGCAAAGAAGCTGTT6 

CACTCAGGAATGAAAATTCTTAGCCAGGAAAGAATTCAGAGCCTGAAGCAGAGCATAGACTTCATG 

GCTGACTTGCAGAAAACTC^AAAGCAGAAAGATGGAACAGACACCTCACAGAGTGGGGAGGACGGA 

GGCTGCTGGCAGAGAGAGAGAGAGGACTCTGGAGATGCCGAAGCACACGCCTTCAAGAGTCCCAGC 

AAAGAAAATAAAAAGAAAGACAAAGATATGCTTGAAGATAAGTTTAAAAGCAATAATTTAGAGAGA 
^AGC^^TTGACCGCATCGT 

AGTCAGTGCX3AATTTGAAAGAAGAAAACCAGATGGAACAACGACGTTGGGACTTCTCCATCCTGTG 

GATGATTCTGATTTAATCTATTCAACCTATGGGGAA.GACTCTGATCTTCCAAGTGATTTCAGCATC 
CATGAGTTTTTGGCCACGTGCC^C^TTA^^ 

ACAAAAGGAGGGCATTCCAGGACCCTACAAGAGATGGAGATGTCATTGCCTGAAGATGAAGGCCAT 
ACTAGGACACTTC^CACAGCAAAAGAAATGGAGAra^ 
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>NEFL 

CTCTCTCCCCTGTCCTCTCTCTCCGGGCTCCCA^ 



:cgcc 



GCGCGCTATGAAGAGGAGGTGCTGAGCCGCra^SSSS^^^ A ^<^CCCTGCGCAACCTGCAG 
GTOCTGACCGAGAGCGCCGCC^^ 




gagaatctccgcaaqaaaggc^gcctcaccaSaS?™^ 
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>PFM2 

GCTCCTCGCCGTCCGCOCIXXAGTGCGAAGGGCTCQAAjGATGGCCGGTTGGCAGAGCTACQTGQAT 

AACCTGATGTGCGATGGCTGCTGCCAGGAGGCCGCCATTGTCGGCTACTGCGACGCCAAATACGTC 
TGC«CAGCCACGGCCGGGCXK:GTCTTTCAGa^ 

GCCGGCGCTGTGCCCAC^CTCOSCTGGCAGGCrc^ 
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>PIASy(prey) 



rc: 

CCGTTCTTTAATATGCTGGAT^^ 



lGCCAGAAGTCC 
ATTAGTCCCAC 

GAACTGCAGCCCGGAGTTAAAGC CGTO^GGTCGTCC^QA^^^^^^^^ T ^^^'^^^' ^^&GG 
CCTCAGGAGGACCAGTACCOKC^C^^ 
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TTGAGTCAAAGGCAGGATCAGGTTCCCCGCCTTCCAGTCCAAAAATCCCGCCA 
GCAGAGGAAAATCCAAAGTG GAGAGAGGGGAAGAAAGAGACCAGTGAGTCAT CCGTC CAGAAGG CG 
GGGAGAG CAGCAGCGGCC CAAGCAGGAGCTGCAGCGAGC CGGGTACCTGGACT C AGCGGTAG CAAC 
CTCGCCCCTTGCAACAAAGGCAGACTGAGCGCCAGAGAGGACG 
CAAC^GTACCAGCAGCAGCGTCGAAAATTTGCAGCTGCOT 

GCTGTGGATACTGCTGAAGCAGGGAAGAAAGAGAAACCAGA7U^AAAAAGTGAAGAAGTCTGACTGT 
GGAGAATGGCAGTGGAGTGTGTGTGTGCCCACCAGTGGAGACTGTGGGCTGGGCACACGGGAGGGC 
ACTCGGACTGGAGCTGAGTGC^AGCAAACCATGAAGACCCAGAGATGTAAGATCCCCTGCAACTGG 
AAGAAGGAATTTGGCGCGGAGTGCAAATACCAGTTCC^GGCC^ 

GCCCTGAAGACCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTC 

ACCATCTCC^UVGCCCTGTGGCyUUlCTGACGAAGCCC^^CCTC^G^ 

AAGGAAGGCAAGAAACAGGAGAAGATGCTGGAT 

>PTPK 

AGTAACTACATGAATGCTGCTC XTATGGACAGCTACAGGCAACGAGCTGCTTT CATCGTCACACAA 
TACCCTCTGCCAAACACTGTAAAAGACTTCTGGAGATTAGTGTATGATTATGGCTGTACCTCCATT 
GTGATGTTAAACGAAGTCGACTTGTCCCAGGGGTGCCCTCAGTACTGGCCAGAGGAAGGGATGCTA 
CGATATGGCCCCATCC^GTGGAATGTATGTCTTGTTCAATGGACTGTGATGTGATCAACCGGATT 
TTTAGGATATGCAATCTAACAAGAC C ACAGGAAGGTTATCTGATGGTGCAACAGTTT CAGTACCTA 
GGATGGGCTTCTCATCGAGAAGTGCCTGGATCCAAAAGGTCATTCTTGAAAGTGATACTTCAGGTG 
GAAAAGTGGCAGGAGGAATGCGAGGAAGGGGAAGGCCGGACGATTATCCACTGCCTAAATGGTGGC 
GGGGGAAGTGGCATGTTCTGTGCTATAGGCATCGTTGTTGA7VATGGTGAAACGGCAAAATGTTGTC 
GATGTTTTCGATGC^I3TAAAGACACTGAGGAACAGCAAGCGAAAC^ 

TACCGTTTCTGCTATGATGTAGCTTTGGAGTACCTGGAATCATCT ' 
>SETBD1 

• AAGGCCTCCACCTCAGK3ACTAGGCATCAAGGATG^ 

ACTGACGACCGAAACAAGATGTCAGTAGTTACTGAAAGCTCTCGAAATTACGGTTACAATCCTTCT 
CCTGTGAAGCCTGAAGGACTTCGCCGCCCACCTAGTAAGACTAGTATGCATCZIAAGCCGAAGACTC 
ATGGCTTCTGCTCAGTCCAACCCTGATGATGTCCTGACACTGTCCAGCAGCACAGAAAGTGAGGGG 
GAAAGTGGGACCAGCCGAAAGCCCAGTGCTGGTCAGACTTCGGCTACAGCGGTTGACAGTGATGAT 

AT^GCGTCAAGTGGCAGTAAAATCAACCCG^ 

ATTAAATCAACCAAC7^TGGCCTCTGTGGACAAGGGGGAGAGCGCACCTGTTCGTAAGAACAGACGC 

TACCTCAACCACAGTTGCAGCCCCAACCTGTTTGTCCAGAATGTCTTCGTGGATACCCATGM? 
CGCTTCCCCTGGGTGGCCTTCTTTGCCAGCAAAAGAATCCGGGCTGGGACAGAACTTACTTGGGAC 
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>SH3GL3 

CCAGACGAAG^GTCAGACAAGCGGTAGAA^^ 

ATGTTTAACTTTTTAGAA^S^SSa^™? ^^ G T<=AAAGGAGTTGGCTGAAAGAAGC 




> sumo -3 

CCCTCC3TCCACCGCTGCCGCCTCCTTCTTCTGCCGCTCC 




GGTTCTGTGGTGCAQ^TCrAA^TTAAGAGGCATACAC 
>TAL1 

TCCCTOATCCTGGCCGCAGCACAGATGCCCGC?^^ 

c^ccggaagctgggcgggtcacaagaggacSgIct 

tttgataaagatgcgatggtggcSg^ 

a<^^ccgaattcttat2^^^ 

q aggagc^gcacggcatccactgcaaca^c^ 
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>TCPG 

CAGACTGACATTGAGATTACACGAGAGGAGG^CTTCACC CGAATT CTCCAGATGGAGGAAGAGTAC 
ATCCAGKTAGCTCTGTGAGGACATTATCCAACTGAAGCCCGATGTGG 

TCAGATTT AGCT CAGCACTACCTTATGCGGG CCAATATCACAGC CATC CGCAGAGTCCGGAAGACA 
GAC^TAATCGCATTGCTAGAGCCTGTGGGGCCCGGATAGTCAGCCGACCAGAGGAACTGAGAGAA 
GATGATGTTGGAACAGGAGCAGGCCTGTTGGAAATCAAGAAAATTGGAGATGAATACTTTACTTTC 
ATCACTGACTGCAAAGACCCCAAGGCCTGCACCATTCTCCT^ 

TCGGAAGTAGAACGCAACCTCCAGGATGCCATGCT^GTGTGTCGCAATGTTCTCCTGGACCCTCAG 
CTGGTGCCAGGGGGTGGGGCCTCCGAGATGGCTGTGGCCCA.TGCCTTGACAGAAAAATCCAAGGCC 
ATGACTGGTGTGGAACAATGGCCATACAGGGCTGTTGCCCAGGCCCTAGAGGTCATTCCTCGTACC 
CTGATCCAGAACTGTGGGGCCAGCACCATCCGTCTACTTAC^ 

GAGAACTGTGAGACCTGGGGTGTAAATGGTGAGACGGGTACTTTGGTGGACATGAAGGAACTGGGC 

ATATGGGAGCCATT GGC TGTGLAAG CTGCAGACTTATAAGACAGC AGTGGAGACGGCAGTTCTGCTA 

CTGCGAATTGATGACATCGTTTCAGGCCACAAAAAGAA^ 

GCTCCTGATGCTGG CCAGGAG 

>VIM 

TCCCCGCGCC^AGACGCAGCCGCGCTCCC^CC^CCCACACCCACCGCGCCCTCGTTCGCCTCTTC 
TCCGGGAGCCAGTCCGCGCCACCGCCGCCGCCC^GGCt^ 

AGGTCCGTGTCCTCGTCCTCCTACCGCAGGATGTTCGGCGGCCCGGGCACCGCGAGCCGGCCGAGC 
TC CAGCCGGAGCTACGTGACTACGTCCACCCGCACCTACAGCCTGGGCAGCGCGCTGCGC CCCAGC 
ACCAGCCGGAGCCTCTACGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCCTCTGCCGTGCGC 
CTGCGGAGeAGCGTGCCCGGGGTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGACGCC 
ATC^CACCGAGTTCAAGAACACCCGCACCAACGAGAAGGTGGAG CTG CAGGAGCTGAATGAC CGC 

TTCGCCAACTACATCGACAAGGTGCGCTTCCTGGAGCAGCAGAATAAGATCCTGCTGGCCGAGCTC 
GAGCAGCTCAAGGGCCAAGGCAAGTCG CGC CTGGGGGACCTCTACGAGGAGGAGATGCGGGAGC tg 

• CGCCGGCAGGTGGACCAGCTAACCAACGACAAAGCCCGCGTCGAGGTGGAGCGCGACAACCTGGC C 
C^GGAC^TCATGCGCCTCCGGGAG&A^^ 

ACCCTGCAATCTTTC^GACAGGM^ 

GTGGAATCTTTGCAA.GAAGAGATTGCCTTTTTGAAGAAACTCCACGAAGAGGAAATCCAGGAGCTG 
CAGGCTGAGATTCAGGAACAGCATGTCCAAATCGATGTGGATGT^ 

GCCCTGCGTGACGXACGTCAGCAATATGAAAGTGTGGCTGCCAAGAACCTGCAGGAGGCAGAAGAA 

TGGTACAAATCCAAGTTTGCTGACCTCTCTGAGGCTGCCAACCGGAACAATGACGCCCTGCGCCAG 
GCAAAG<^GGAGTCC^CTGAGTACCGGAGAC^GGTGC^^ 

AAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCCGTTGAAGCT 

GCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATGAAGGAGGAAATGGCT 

CGTCACCTTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCCCTTGACATTGAGATTGCCACC 

TACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCTTCCAAACTTTTCCTCCCTG 
AACCTGAGGGxAAACTAATCTGGATTCACTCCCTCT^ 

ATTAAGACGGTTGAAACTAGAGATGGACAGGTTATCAACGAAACTTCTCAGCATCACGATGACCTT 
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>VIMc 

TTGAAGAAACTCCACGAAGAGGAAATC ^GQAGCTcSG^TCAS^CAGQ^CAnr ar^^^ 
ATGCGTGAAATGGAAGAGAACTTTTCCCOTTGAA^CT 

cag<^tc^gattc*c^ T atg^^ 

AATGTTAAGATGGCCCTTGACATTGAGATTGCCArCTAra^R^ 
AGGATTTCTCTGCCTCTTCCA^^^ 

GTTATCAACGAAACTTCTCA^^ 
>2HX1 

^^^^^^^^^ 

^ctggcagaattaaaagttagctacctt^^ 
< ™toaaaataacaggcctgac<5aaagSgag^ 
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>ZNF33B 

TGTTATGAATGTGGGAAAAC CTTCTG CT TGLAAGTC AGACCTC^CAATACATC AGAGAACGCACACA 

ggggagaaaccctttgcatgtcctgaatgtgggaaattctttagccataagtcaaccctctctc^a 
cattatagaacac^cacgggggagaaaccctatgaatgtcatgaatgtggaaaaa 
aaatcatacctaacaaaacataatagaacacatacaggggagaaaccctatgaatgtaa 
ggaaaaaccttctgccagaagtcacaactcactcagcat 

tatgaatgtaatgagtgtggaaaagctttctgccataagtcagctctaattgtacatcagagaacc 
catacacaagaaaagccttataaatgtaacgaatgtggaaaatctttctgtgtgaagtcaggactt 
attttacatgagagaaagcacacgggggagaaaccctatgaatg caatgaatgtgggaaatc ctt c 
agtcacaaatc^tcactcacagtacattacagggct 

gaatgtggaaaaatc ttttaccgt aaatcagac cttgctaaacatcagagatcac atacaggggaa 
aagccctatgaatgtaacacatgcaggaaaaccttctctcaaaagtcaaatctcattgtacatcag 
agaacacacataggagaaaaaccttatgaa 
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